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Part of Paper No. 0 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: January 16, 20 04, 09:36:27 ; Search time 42 Seconds 

(without alignments) 
861.659 Million cell updates/sec 

Title: US-09-843-221A-2 
Perfect score: 1238 

Sequence: 1 MDKTHTCPPCPAPELLGGPS MHEALHNHYTQKSLSLSPGK 228 



Scoring table: 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched : 



1107863 seqs, 158726573 residues 



Total number of hits satisfying chosen parameters: 1107863 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : A_Geneseq_19Jun03 : * 

1 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1980 . DAT: * 

2 : /SI DSl/gcgdat a/gene seq/geneseqp-embl/AAl 981 . DAT: * 

3 : /SIDSl/gcgdata/ geneseq/geneseqp-embl/AA1982 . DAT : * 

4 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1983 .DAT:* 

5 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1984 .DAT:* 

6 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1985 .DAT: * 

7 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1986 .DAT: * 

8 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1987 .DAT: * 

9 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1988 .DAT: * 
10 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1989 . DAT: * 
11 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1990 .DAT: * 
12 : /SIDSl/gcgdata/ geneseq/geneseqp-embl /AA1991 . DAT : * 
13 : / SIDSl/gcgdata/ geneseq/ geneseqp-embl/AA1992 . DAT : * 
14 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1993 .DAT: * 
15 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1994 .DAT: * 
16: /SIDS1/ gcgda t a/geneseq/geneseqp - embl /AA1 995. DAT : * 
17 : /SIDSl/gcgdata/geneseq/ geneseqp-embl/AA1996 . DAT : * 
18 : / SIDSl/gcgdata/ geneseq/geneseqp-embl /AA1997 . DAT : * 
19 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1998 . DAT: * 
20 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1999 . DAT: * 
21 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA2 000 .DAT: * 
22 : / SIDSl/gcgdata/ geneseq/ geneseqp-embl /AA2001 . DAT : * 
23 : /SIDSl/gcgdata/ geneseq/ geneseqp-embl/AA2002 . DAT : * 
24 : / SIDSl/gcgdata/geneseq/ geneseqp-embl /AA2 003 . DAT : * 

Pred. No. is the number of results predicted by chance to have a 

score greater than or equal to the score of the result being printed, 



and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
AAB16955 

ID AAB16955 standard; Protein; 228 AA. 
XX 

AC AAB16955; 
XX 

DT 31-OCT-2000 (first entry) 
XX 

DE Human IgGl Fc protein sequence SEQ ID NO: 2. 
XX 

KW Modified peptide; therapeutic agent; fusion; Fc domain; cancer; 

KW autoimmune disease; cytostatic; antiasthmatic; thrombolytic; VEGF; 

KW immunosuppressive; EPO; TPO; CTLA4 ; mimetic; IL-1; TNF ; antagonist; 

KW MMP; inhibitor; erythropoietin; thrombopoietin; interleukin 1; 

KW cytotoxic T cell lymphocyte antigen 4; tumour necrosis factor; 

KW vascular endothelial growth factor; matrix metalloproteinase; 

KW asthma; thrombosis; pharmaceutical. 
XX 

OS Homo sapiens . 
XX 

PN WO200024782-A2. 
XX 

PD 04-MAY-2000. 
XX 

PF 25-OCT-1999; 99WO-US25044 . 
XX 

PR 23-OCT-1998; 98US- 01 053 71 . 

PR 22-OCT-1999; 99US-0428082 . 
XX 

PA (AMGE-) AMGEN INC. 
XX 

PI Feige U, Liu C, Cheetham J, Boone TC; 
XX 

DR WPI; 2000-350702/30. 

DR N-PSDB; AAA69443 . 
XX 

PT Novel composition of matter comprising an Fc domain and 

PT pharmacologically active peptides, useful for treating cancer and 

PT autoimmune diseases - 

XX 

PS Claim 7; Page 176-177; 608pp ; English. 
XX 

CC The present invention describes composition of matter (I) comprising an 

CC Fc domain, pharmacologically active peptides, and linkers. Where (I) is: 

CC (Xl)a-Fl- (X2)b, where: Fl = an Fc domain; XI and X2 = are each 

CC independently selected from -(LI) c-Pl, - (LI) c-Pl- (L2) d-P2 , 

CC -{Ll)c-Pl-(L2)d-P2-(L3)e-P A 3, or - (LI ) c-Pl - (L2 ) d-P2 - (L3 ) e-P3 - (L4 ) f -P4 

CC where PI, P2, P3 , and P4 = are each independently sequences of 

CC pharmacologically active peptides; LI, L2, L3, and L4 = are each 

CC independently linkers; and a, b, c, d, e, and f = are each independently 

CC 0 or 1, provided that at least 1 of a and b is 1. The composition can 

CC have cytostatic, antiasthmatic, thrombolytic and immunosuppressive 

CC activities. DNAs , vectors and host cells from the present invention can 

CC be used for producing pharmaceutical compositions. The compositions are 

CC useful for treating cancer, asthma, thrombosis, or autoimmune diseases. 

CC The use of an Fc domain (rather than a Fab domain) can provide a longer 

CC half-life or incorporate functions such as Fc receptor binding, protein 



i 



CC A binding, complement fixation, and possibly placental transfer. AAA69443 

CC to AAA69526 and AAB16955 to AAB18003 represent nucleotide and amino acid 

CC sequences used in the exemplification of the present invention. 
XX 

SQ Sequence 228 AA; 

Query Match 100.0%; Score 1238; DB 21; Length 228; 

Best Local Similarity 100.0%; Pred. No. 1.2e-91; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

IMIIIIMIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMI 

Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKA 120 

IIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIII 

Db 61 DGVEVHNAKTKPREEQYNSTYRWS VLTVLHQDWLNGKEYKCKVSNKALPAP I EKTI SKA 120 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

IIMIIIIIIIIIIIMMIIIIIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIM 
Db 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIII 

Db 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 



RESULT 2 
AAY96529 

ID AAY96529 standard; Protein; 228 AA. 
XX 

AC AAY96529; 
XX 

DT 04-SEP-2000 (first entry) 
XX 

DE Human IgGl Fc chain. 
XX 

KW Immunoglobulin; IgGl; Fc; thrombopoietin; mimetic; TMP; TPO; platelet; 
KW megakaryocyte; production; anti-human immunodeficiency virus; anti-HIV; 
KW anti-anaemic; dermatological ; immunosuppressive; ant i- inflammatory. 
XX 

OS Homo sapiens. 
XX 

PN WO200024770-A2. 
XX 

PD 04-MAY-2000. 
XX 

PF 22-OCT-1999; 99WO-US24834 . 
XX 

PR 23-OCT-1998; 98US-0105348 . 
XX 

PA (AMGE-) AMGEN INC. 
XX 

PI Liu C, Feige U, Cheetham J; 
XX 

DR WPI; 2000-365108/31. 
DR N-PSDB; AAA29220. 



XX 

PT Thrombopoietic peptides which activate mpl receptors and increase the 

PT production of platelets or platelet precursors, useful for treatment of 

PT diseases which involve thombocytopenia 
XX 

PS Disclosure; Page 76-77; 91pp; English. 
XX 

CC A compound which binds to an mpl receptor comprising a thrombopoietin 

CC mimetic peptide (TMP) dimer joined by a linker [TMP_1- (L_l)_nTMP_2] , 

CC is new. TMP_1 and TMP_2 are amino acid sequences varying from at least 

CC 10 to 14 residues in length comprising X_2-X_l_0, X_2-X_l_l, X_2-X_l_2, 

CC X_2-X_l_3, X_2-X_l_4, X_1-X_1_0, X_l-X_l_l, X_l-X_l_2, X_l-X_l_3, and 

CC X_l-X_l_4. X_l = I , A, V, L, S or R; X_2 = E, D, K or V; X_3 = G or A; 

CC X_4 = P; X_5 = T or S; X_6 = L, I, V, A or F; X_7 = R or K; X_8 = Q, N, 

CC or E; X_9 = W, Y or F; X_1_0 = L, I, V, A, F, M, or K; X_l_l = A, I, V, 

CC L, F, S, T, K, H, or E; X_l_2 = A, I, V, L # F, G, S, or Q; X_l_3 = R, K, 

CC T, V, N, Q or G; X_l_4 = A, I , V, L, F, T, R, E, or G; L_l = linker 

CC comprising 1 to 20 amino acids; and n = 0 or 1 . The compounds bind to and 

CC activate the c-Mpl receptor which mediates the activity of endogenous 

CC thrombopoietin. The TMPs are useful for increasing the production of 

CC platelets or platelet precursors (e.g. megakaryocytes) in a mammal, which 

CC is useful for treatment of diseases which involve thombocytopenia, e.g. 

CC aplastic anaemia, immune thrombocytopenia (ITP) , human immunodeficiency 

CC virus associated ITP, and systemic lupus erythematosus. 
XX 

SQ Sequence 228 AA; 



Query Match 100.0%; Score 1238; DB 21; Length 228; 

Best Local Similarity 100.0%; Pred. No. 1.2e-91; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCWVDVSHEDPEVKFNWYV 60 

M MM II I! II :l HUM II I'lirillll I 1 I 'II III 

Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

QY 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKA 120 

IIIIIIMIIIIIIIIIIIIMIIIIIMIIIIIIIIIIIIIIIMIIIIIIIIIIIIM 

Db 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 18 0 

MIMIIIIMIIIMIIIIIIIIIIIIIIIIIIIIIMIMIIIIIIIIIIMIIIIII 

Db 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

Qy . 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMII 

Db 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 



RESULT 3 
AAB98953 

ID AAB98953 standard; Protein; 228 AA. 
XX 

AC AAB98953; 
XX 

DT 14-AUG-2001 (first entry) 
XX 

DE Human IgGl Fc region. 



XX 

KW Human; IgGl; immunoglobulin; Fc region; Fc fusion protein; 

KW misfolding; therapy; cancer; osteoarthritis; AIDS; obesity; 

KW inflammation; transplant rejection. 
XX 

OS Homo sapiens. 
XX 

PN WO200134638-A1. 
XX 

PD 17-MAY-2001. 
XX 

PF 10-NOV-2000; 2000WO-US30798 . 
XX 

PR 12-NOV-1999; 99US - 016518 8 . 

PR 09-NOV-2000; 2 O0OUS - 0709704 . 
XX 

PA (AMGE-) AMGEN INC. 
XX 

PI Treuheit MJ, O' Conner SR, Kosky AA; 
XX 

DR WPI; 2001-335908/35. 

DR N-PSDB; AAH25762 . 
XX 

PT Correcting disulfide bond misfolds in Fc-containing proteins, 

PT particularly therapeutic Fc-containing fusion proteins or antibodies, 

PT by treatment with copper halide 

XX 

PS Claim 30; Fig 5; 59pp; English. 
XX 

CC The present invention describes a process for preparing a 

CC pharmacologically active compound, involving preparing a compound 

CC comprising an immunoglobulin Fc domain fused to a protein of interest, 

CC treating the compound with a copper (II) halide and isolating the treated 

CC. molecule. This can be used to correct misfolding of Fc domain containing 

CC proteins, for use in therapeutic agents which may be used in the 

CC treatment of cancer, inflammation, transplant rejection, AIDS, 

CC osteoarthritis and obesity. The present sequence is the IgGl Fc domain. 

XX 

SQ Sequence 228 AA; 



Query Match 100.0%; Score 1238; DB 22; Length 228; 

Best Local Similarity 100.0%; Pred. No. 1.2e-91; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEWCVVVDVSHEDPEVKFNWYV 60 

MIMM Mill II I II Ml MUM MM MMI Ml Ml I 

Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI SRTPEVTCVWDVSHEDPEVKFNWYV 60 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKA 12 0 

i ! ■ ! . I ! ■ i i I I i : . i ! I 1 1 

Db 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

Qy i 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

IIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIMMIIIIIIIIIMI 

Db 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 18 0 



Qy 



181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 



IIIIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIIIMIIIIII 

Db 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

RESULT 4 
AAU81074 

ID AAU81074 standard; Protein; 228 AA. 
XX 

AC AAU81074; 
XX 

DT 09-APR-2002 (first entry) 
XX 

DE Human IgGl Fc. 
XX 

KW Human; IgG Fc; anticoagulant; thrombolytic; cytostatic ; 

KW antiinflammatory; immunosuppressive; osteopathic; antagonist; 

KW laminin; saw-scaled viper; echistatin; integrin; selectin; vinculin; 

KW platelet aggregation; angiogenesis ; tumour; inflammation; 

KW autoimmune disease; rheumatoid arthritis; osteoporosis. 

XX 

OS Homo sapiens. 
XX 

PN WO200181377-A2 . 
XX 

PD 01-NOV-2001. 
XX 

PF 23-APR-2001; 2 00 1W0-US13 069 . 
XX 

PR 21-APR-2000; 2 000US - 198 9 19P . 

PR 03-MAY-2000; 2000US -2 013 94P . 
XX 

PA (AMGE-) AMGEN INC. 
XX 

PI Feige U, Kohno T, Lacey DL, Boone TC; 
XX 

DR WPI; 2002-062025/08. 

DR N-PSDB; ABK24097. 
XX 

PT Composition comprising integrin or adhesion antagonistic peptide and 

PT vehicle, useful for treating or preventing platelet aggregation, has a 

PT longer half-life than free peptide - 
XX 

PS Claim 9; Fig 3; 68pp ; English. 
XX 

CC The invention relates to a composition comprising an integrin/adhesion 

CC antagonistic peptide (I) and a vehicle e.g. IgG Fc. The peptides 

CC are based on laminin or saw-scaled viper echistatin and target integrin, 

CC selectin or vinculin. Also included are compounds of formula (la) and 

CC their multimers (X A 1) _a-F A l- (X A 2) _b where,- 

CC F A 1 = Fc domain; 

CC X A 1 and X A 2 = — (L A 1) c-P A l, (L A 1) _c-P A l - (L A 2 ) _d-P A 2 , 

CC (L A l)_c-P A l- (L A 2)__d-P A 2- (L A 3)_e-P A 3 or 

CC (L A 1)_C-P A 1- (L A 2)_d-P A 2- (L A 3)_e-P A 3- (L A 4)_f-P A 4; 

CC P A 1-P A 4 = same or different (I); 

CC L A 1-L A 4 = same or different linkers; 

CC a-f = 0 or 1, provided at least one of a and b = 1, 

CC a nucleic acid that encodes (la), an expression vector containing the 



CC nucleic acid, host cells containing the vector, producing a 

CC pharmaceutical ly active compound (B) by covalently linking at least one 

CC Fc domain to at least one amino acid sequence of a selected randomized 

CC (I) and any of six laminin-related peptides (lb) . The compositions are 

CC used prophylactically and therapeutically in the same way as (I), e.g. to 

CC inhibit platelet aggregation or angiogenesis (tumours) , or to treat 

CC inflammation and autoimmune diseases (e.g. rheumatoid arthritis) and many 

CC different forms of osteoporosis, also for diagnosis. Attaching the 

CC vehicle (especially Fc domain) to (I) increases the half-life (free (I) 

CC are normally degraded very quickly in vivo) . The present sequence 

CC is human IgGl Fc which is used as a vehicle for the antagonists of 

CC the invention. 

XX 

SQ Sequence 228 AA; 

Query Match 100.0%; Score 1238; DB 23; Length 228; 

Best Local Similarity 100.0%; Pred. No. 1.2e-91; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

IIIIIIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIM 

Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 60 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

IIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI 

Db 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

QY 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 18 0 

i ! I I i I I I I ■ : ■ ' ; I I ■ I 1 I ; ! I : I I ! I ' ! I 

Db 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

MIIIIIIIIIIIIIIIIIIMIIIIMIIIIIMMIIIIIIIIIM 

Db 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLS PGK 228 

RESULT 5 
ABB73410 

ID ABB73410 standard; Protein; 228 AA. 
XX 

AC ABB73410; 
XX 

DT 05-APR-2002 (first entry) 
XX 

DE Human immunoglobulin Gl Fc (IgGl Fc) amino acid SEQ ID NO:2. 
XX 

KW Modified peptide; mimetic,- Fc domain; fusion; immunoglobulin G; IgG; 

KW EPO; erythropoietin; TPO; tumour necrosis factor alpha inhibitor; 

KW TNF-alpha inhibitor; interleukin 1 antagonist; IL-1 antagonist; TMP; 

KW TPO mimetic peptide; EPO mimetic peptide; EMP; VEGF antagonist; 

KW MMP inhibitor; antiinflammatory; antitumour; immunosuppressive; 

KW cytostatic; antirheumatic; ant iarthritic; antidiabetic; ophthalmological ; 

KW antianaemic; anorectic; antiinf ertility; haemostatic; dermatological; 

KW neuroprotective; inflammatory disease; autoimmune disease; tumour growth; 

KW cancer; rheumatoid arthritis; diabetic retinopathy; infertility; obesity; 

KW sleep disorder; neurological degenerative disease; anaemia; 

KW thrombocytopaenia; metastatic tumour; systemic lupus erythematosus; 



KW Fanconi * s syndrome . 
XX 

OS Homo sapiens. 
XX 

PN WO200183525-A2. 
XX 

PD 08-NOV-2001. 
XX 

PF 02-MAY-2001; 2001WO-US14310 . 
XX 

PR 03-MAY-2000; 2 000US- 0563286 . 
XX 

PA (AMGE-) AMGEN INC. 
XX 

PI Feige U, Liu C, Cheetham JC, Boone TC, Gudas JM; 
XX 

DR WPI; 2002-130313/17. 

DR N-PSDB; ABL35760. 
XX 

PT Novel vehicle-peptide molecule or its multimers useful for treating 

PT inflammatory and autoimmune diseases, cancer, rheumatoid arthritis, 

PT diabetic retinopathy, obesity, sleep disorders and infertility - 
XX 

PS Claim 7; Fig 4; 176pp; English. 
XX 

CC The present invention describes a vehicle-peptide molecule (I) or its 

CC multimers. (I) can have antiinflammatory, antitumour, immunosuppressive, 

CC cytostatic, antirheumatic, antiarthritic, antidiabetic, ophthalmological , 

CC antianaemic, anorectic, antiinf ert ility , haemostatic, dermatological and 

CC neuroprotective activities. (I) can be used as a therapeutic or 

CC prophylactic agent as well as for screening purposes. (I) is useful for 

CC diagnosing diseases characterised by dysfunction of their associated 

CC protein of interest, for identifying normal or abnormal proteins of 

CC interest, as a part of diagnostic kit to detect the presence of their 

CC proteins of interest in a biological sample. Additionally, (I) is useful 

CC for treating inflammatory and autoimmune diseases, tumour growth, cancer, 

CC rheumatoid arthritis, diabetic retinopathy, obesity, sleep disorders, 

CC infertility, and neurological degenerative diseases. (I), comprising 

CC EPO-mimetic compounds are useful for treating disorders characterised by 

CC low red blood cell levels such as anaemia. The TPO-mimetic comprising 

CC compounds are useful for treating conditions that involve an existing 

CC megakaryocyte/platelet deficiency or an expected megakaryocyte/platelet 

CC deficiency, such as thrombocytopaenia, aplastic anaemia, metastatic 

CC tumour which result in thrombocytopaenia, systemic lupus erythematosus, 

CC and Fanconi * s syndrome. ABB72403 to ABB73426 and ABL35695 to ABL35777 

CC represent amino acid and nucleic acid sequences used in the 

CC exemplification of the present invention. 

XX 

SQ Sequence 228 AA; 



Query Match 100.0%; Score 1238; DB 23; Length 228; 

Best Local Similarity 100.0%; Pred. No. 1.2e-91; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 60 

MIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIII 

Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 



Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKA 120 

II M/l II II I II II II IIMI, II II || Mi I 

Db 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

IIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIIIIMIIIIIIIIIIIIIIMIMM 

Db 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

MIMIIIMIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIMI 

Db 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 



RESULT 6 
AAE14310 

ID AAE14310 standard; Protein; 228 AA. 
XX 

AC AAE14310; 
XX 

DT 07-MAR-2002 (first entry) 
XX 

DE Human immunoglobulin G (IgGl) Fc. 
XX 

KW Human; calcitonin; CT; CT receptor; Fc domain; therapy; osteoporosis; 

KW immunoglobulin G; IgG; osteopathic. 

XX 

OS Homo sapiens. 
XX 

PN WO200183526-A2 . 
XX 

PD 08-NOV-2001. 
XX 

PF 03-MAY-2001; 2001WO-US14320 . 
XX 

PR 03-MAY-2000; 2000US-201511P . 
PR 02-MAY-2001; 2001US-0847712 . 
XX 

PA (AMGE-) AMGEN INC. 
XX 

PI Liu C, Marshall WS, Reynolds A; 
XX 

DR WPI; 2002-034503/04. 
DR N-PSDB; AAD23840. 
XX 

PT Compositions comprising Calcitonin receptor modulator domains, useful 

PT for treating osteoporosis - 

XX 

PS Claim 8; Fig 3; 64pp; English. 
XX 

CC The invention relates to therapeutic agents that modulate the 
CC activity of calcitonin (CT) receptor. Modulators of CT receptor 
CC comprise a CT receptor modulating domain and a vehicle such as a 
CC polymer or an Fc domain, where the vehicle is covalently attached 
CC to the CT receptor modulating domain. The compositions comprising 
CC CT receptor modulating domains are used to treat osteoporosis. 
CC The present sequence is human immunoglobulin G (IgGl) Fc protein 



CC used in the invention. 
XX 

SQ Sequence 228 AA; 



Query Match 100.0%; Score 1238; DB 23; Length 228; 

Best Local Similarity 100.0%; Pred. No. 1.2e-91; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELL^GPSVFLFPPKPKDTmiSRTPEVTCvVVDVSHEDPEVKFNWyV 60 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKA 120 

IIIIIIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIII 

Db 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

IMIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIII 

Db 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIII 

Db 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 



RESULT 7 
AAU73018 

ID AAU73018 standard; Protein; 228 AA. 
XX 

AC AAU73 018; 
XX 

DT 12-MAR-2002 (first entry) 
XX 

DE Human immunoglobulin G (IgG) Fc region. 
XX 

KW Human; parathyroid hormone; PTH; parathyroid hormone-related protein; 

KW PTHrP; bone resorption inhibitor; osteoprotegrin; OPG; OPG-L antibody; 

KW calcitonin; bisphosphonate; oestrogen; oestrogen receptor; tibolone; 

KW osteopenia; hyperthyroidism; hypercalcaemia; tumour metastasis; bone; 

KW breast cancer; prostate cancer; cachexia; anorexia; osteoporosis; 

KW Paget' s disease; osteomyelitis; osteonecrosis; bone cell death; 

KW Gaucher' s disease; sickle cell anaemia; systemic lupus erythematosus; 

KW rheumatoid arthritis; periodontal disease; alopecia; fracture repair; 

KW immunoglobulin G; IgG. 
XX 

OS Homo sapiens. 
XX 

PN WO200181415-A2. 
XX 

PD 01-NOV-2001. 
XX 

PF 27-APR-2001; 2001WO-US13528 . 
XX 

PR 27-APR-2000; 2 000US-2 00053 P . 

PR 28-JUN-2000; 2000US-214860P . 

PR 06-FEB-2001; 2001US-266673P . 

PR 26-APR-2001; 2001US-0843221 . 



XX 

PA (AMGE-) AMGEN INC. 
XX 

PI Kostenuik P, Liu C, Lacey DL; 
XX 

DR WPI; 2002-066435/09. 

DR N-PSDB; AAS97392. 
XX 

PT Composition, useful for treating osteopenia, comprises parathyroid 

PT hormone and parathyroid hormone-related protein receptor modulators - 
XX 

PS Claim 6; Figure 3; 107pp ; English. 
XX 

CC The invention relates to a composition (I) comprising modulators of 

CC parathyroid hormone (PTH) and parathyroid hormone -related protein ( PTHrP) 

CC which comprise a PTH/ PTHrP modulating domain and a vehicle. (I) 

CC comprising PTH agonist optionally with a bone resorption inhibitor, such 

CC as osteoprotegrin (OPG) , OPG-L antibody, calcitonin, bisphosphonates , 

CC oestrogens, oestrogen receptor modulators and tibolone is useful for 

CC treating osteopenia. (I) is useful for therapeutic and prophylactic 

CC purposes. Antagonists of PTH receptor are useful in treating primary and 

CC secondary hyperthyroidism, hypercalcaemia , tumour metastases, 

CC particularly breast and prostate cancer, cachexia and anorexia, 

CC osteopenia, including various forms of osteoporosis, Paget ' s disease of 

CC bone, osteomyelitis, osteonecrosis or bone cell death, associated with 

CC traumatic injury or nontraumatic necrosis associated with Gaucher' s 

CC disease, sickle cell anaemia, systemic lupus erythematosus, rheumatoid 

CC arthritis, periodontal disease and alopecia. PTH receptor agonists are 

CC useful as therapeutic agents in conditions including fracture repair 

CC (including healing of non-union fractures) , osteopenia, including various 

CC forms of osteoporosis. AAU73 018-AAU73181 represent parathyroid hormone 

CC and parathyroid hormone related protein (PTH/ PTHrP) modulators and 

CC related amino acid sequences of the invention. 

XX 

SQ Sequence 228 AA; 

Query Match 100.0%; Score 1238; DB 23; Length 228; 

Best Local Similarity 100.0%; Pred. No. 1.2e-91; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

IIIMIIIIIIIIIMIMMIIIIIIIMIIIIIIIMIIIIIIIMIIIIIIIIIIII 

Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 12 0 

IIIIIMIIIIIIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMIIII 

Db 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

IIIMIIIIIIIIIIMIIIIMIIIMIMIIIIIIIIMIIIIMIIIIMIIIIIII 

Db : 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIMIIMIIIIIII 

Db 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 



RESULT 8 
AAG66012 

ID AAG66012 standard; Protein; 228 AA. 
XX 

AC AAG66012; 
XX 

DT 27-FEB-2002 (first entry) 
XX 

DE Human immunoglobulin (Ig) Gl Fc region sequence. 
XX 

KW Apo-AI; amphipathic; pharmaceutical; peptide mimic; antilipemic; 

KW anti-HIV; virucide; immunoglobulin; IgGl . 

XX 

OS Homo sapiens. 
XX 

PN WO200181376-A2. 
XX 

PD 01-NOV-2001. 
XX 

PF 23-APR-2001; 2 001WO-US13 068 . 
XX 

PR 21-APR-2000; 2000US-198920P . 
XX 

PA (AMGE-) AMGEN INC. 
XX 

PI Kohno T; 
XX 

DR WPI; 2002-049262/06. 

DR N-PSDB; AAI67658. 
XX 

PT Recombinant or modified therapeutic agents having Apo-AI amphipathic 

PT helix peptide activity useful in treatment of hypercholesterolemia and 

PT viral infections such as herpes simplex virus, human immunodeficiency 

PT virus 
XX 

PS Claim 8; Fig 3A-B; 4 9pp ; English. 
XX 

CC The invention provides a composition comprising a therapeutic agent that 

CC has activity similar to Apo-AI amphipathic helix peptide, but with better 

CC pharmaceutical characteristics attached to a vehicle through the 

CC peptide's N-terminus or C-terminus having a specified formula. The 

CC peptide mimic has greater half-life compared to conventional Apo-AI 

CC amphipathic helix peptide. The compositions are useful for treating 

CC hypercholesterolemia and viral infection such as HIV, HSV. The present 

CC sequence represents the human immunoglobulin (Ig) Gl Fc region which acts 

CC as a vehicle. 

XX 

SQ Sequence 228 AA; 

Query Match 100.0%; Score 1238; DB 23; Length 228; 

Best Local Similarity 100.0%; Pred. No. 1.2e-91; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI SRTPEVTCVWDVSHEDPEVKFNWYV 60 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI SRTPEVTCVWDVSHEDPEVKFNWYV 60 



Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

IIIIIIIIIIIIIIIIIMIMIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
Db 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

IIIIIMIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIMMIIIIIIIIIIIMIMII 

Db 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

I I I I I I I I I I I I I I I i II I I I . I I I I I I I M I I I I I I I I I I I II I 
Db 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 



RESULT 9 
ABB04279 

ID ABB04279 standard; Protein; 228 AA. 
XX 

AC ABB04279; 
XX 

DT 13-FEB-2002 (first entry) 
XX 

DE Human IgGl Fc domain. 
XX 

KW Glucagon antagonist; antidiabetic; anti-hormonal ; Fc domain; 

KW non-insulin dependent diabetes mellitus; human; immunoglobulin G; IgG. 

XX 

OS Homo sapiens. 
XX 

PN WO200183527-A2. 
XX 

PD 08-NOV-2001. 
XX 

PF 03-MAY-2001; 2001WO-US1432 1 . 
XX 

PR 03-MAY-2000; 2000US-2 01436P . 
PR 02-MAY-2001; 2001US-084724 9 . 
XX 

PA (AMGE-) AMGEN INC. 
XX 

PI Marshall WS, Stark KL; 
XX 

DR WPI; 2002-017738/02. 
DR N-PSDB; ABA03672 . 
XX 

PT Compositions comprising glucagon antagonist domains, useful for 

PT treating diabetes mellitus - 

XX 

PS Claim 8; Fig 2; 54pp; English. 
XX 

CC The invention relates to compositions comprising a glucagon antagonist 
CC domain and a vehicle, such as a polymer (e.g. PEG or dextran) or, 
CC preferably, an Fc domain. The vehicle is covalently attached to the 
CC glucagon antagonist domain. The compositions are administered to 
CC treat non-insulin dependent diabetes mellitus. The present sequence 
CC is the human IgG Fc domain, which may be used as the vehicle 
CC in the compositions of the invention. 
XX 



SQ Sequence 228 AA; 

Query Match 100.0%; Score 1238; DB 23; Length 228; 

Best Local Similarity 100.0%; Pred. No. 1.2e-91; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

IIIMIIIIIIIIIIMIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIII 

Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKA 120 

IIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIMIIIIIIIIIIIIM 

Db 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKA 120 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

i I i.l ! : I m , , l; ! ; 

Db 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIII 

Db 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 



RESULT 10 
ABJ38267 

ID ABJ38267 standard; Protein; 228 AA. 
XX 

AC ABJ382 67; 
XX 

DT 12-JUN-2003 (first entry) 
XX 

DE Human IgGl Fc protein SEQ ID No 2 . 
XX 

KW TALL- 1 -binding protein; TALL- 1 ; B-cell-mediated autoimmune disease; 

KW systemic lupus erythematosus; B-cell-mediated cancer; lymphoma ,- 

KW inflammation; rheumatoid arthritis; acute pancreatitis; atherosclerosis; 

KW Alzheimer's disease; asthma; cachexia; cirrhosis; diabetes; osteoporosis; 

KW glomerulonephritis; Hashimoto's thyroiditis; ischaemic injury; psoriasis; 

KW multiple myeloma; multiple sclerosis; Parkinson's disease; vasculitis; 

KW gene therapy; human IgGlFc; human. 

XX 

OS Homo sapiens . 
XX 

PN WO200292620-A2 . 
XX 

PD 21-NOV-2002. 
XX 

PF 13-MAY-2002; 2 002WO-US15273 . 
XX 

PR ll-MAY-2001; 2 001US-2 90 1 96P . 
XX 

PA (AMGE-) AMGEN INC. 
XX 

PI Min H, Hsu H; 
XX 

DR WPI; 2003-156719/15. 

DR N-PSDB; ABT33856. 



XX 

PT New TALL- 1 -binding polypeptide, useful for modulating the activity of 

PT TALL- 1 and in treating, preventing or diagnosing a B-cell -mediated 

PT autoimmune diseases, cancers or lymphomas - 
XX 

PS Claim 36; Fig 3; 236pp; English. 
XX 

CC The invention relates to a novel TALL- 1 -binding polypeptide comprising a 

CC defined sequence in the specification. The composition is useful in 

CC modulating the activity of TALL- 1 , and in treating, preventing, 

CC ameliorating, diagnosing or prognosing a B-cell-mediated autoimmune 

CC disease (e.g. systemic lupus erythematosus) or B-cell-mediated cancer or 

CC lymphoma. The composition may also be used in treating inflammations 

CC (e.g. rheumatoid arthritis), acute pancreatitis, Alzheimer's disease, 

CC asthma, atherosclerosis, cachexia, cirrhosis, diabetes, 

CC glomerulonephritis, Hashimoto's thyroiditis, ischaemic injury, multiple 

CC myeloma, multiple sclerosis, osteoporosis, Parkinson's disease, psoriasis 

CC and vasculitis. Disorders may be treated with the novel composition using 

CC gene therapy. This sequence represents a human IgGlFc protein relating to 

CC the TALL - 1 sequence of the invention. 

XX 

SQ Sequence 22 8 AA; 



Query Match 100.0%; Score 1238; DB 24; Length 228; 

Best Local Similarity 100.0%; Pred. No. 1.2e-91; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

IMMM Mill IMIMMI MMM II II MM Mill IMIMI 

Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

Qy 61 DGVEWNAKTKPREEQYNSTYRWSVliTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKA 120 

IIIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIII 

Db 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKA 120 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 18 0 

IIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIIIIIIIMMIIIIIIIIIIIII 

Db 121 KGQPREPQVTTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

^ . : ! i . i i . . II, II I I I I ! I 

Db 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 



RESULT 11 
AAB17957 

ID AAB17957 standard; Protein; 243 AA. 
XX 

AC AAB17957; 
XX 

DT 31-OCT-2000 (first entry) 
XX 

DE Fc-MMP inhibitor fusion protein sequence SEQ ID NO: 1068. 
XX 

KW Modified peptide; therapeutic agent; fusion; Fc domain; cancer; 

KW autoimmune disease; cytostatic; antiasthmatic; thrombolytic; VEGF; 

KW immunosuppressive; EPO; TPO; CTLA4 ; mimetic; IL-1; TNF; antagonist; 



KW MMP ; inhibitor; erythropoietin; thrombopoietin; interleukin 1; 

KW cytotoxic T cell lymphocyte antigen 4; tumour necrosis factor; 

KW vascular endothelial growth factor; matrix metalloproteinase; 

KW asthma; thrombosis; pharmaceutical. 
XX 

OS Synthetic. 
XX 

PN WO200024782-A2 . 
XX 

PD 04-MAY-2000. 
XX 

PF 25-OCT-1999; 9 9WO-US25 044 . 
XX 

PR 23-OCT-1998; 98US-0105371 . 

PR 22-OCT-1999; 99US-0428082 . 
XX 

PA (AMGE-) AMGEN INC. 
XX 

PI Feige U, Liu C, Cheetham J, Boone TC; 
XX 

DR WPI; 2000-350702/30. 

DR N-PSDB; AAA69507. 
XX 

PT Novel composition of matter comprising an Fc domain and 

PT pharmacologically active peptides, useful for treating cancer and 

PT autoimmune diseases - 

XX 

PS Example 7; Page 585-586; 608pp ; English. 
XX 

CC The present invention describes composition of matter (I) comprising an 

CC Fc domain, pharmacologically active peptides, and linkers. Where (I) is: 

CC (Xl)a-Fl- (X2)b, where: Fl = an Fc domain; XI and X2 = are each 

CC independently selected from -(Ll)c-Pl, - (LI) c-Pl- (L2) d-P2 , 

CC - (Ll)c-Pl- (L2)d-P2- (L3)e-P A 3, or - (LI) c-Pl- (L2 ) d-P2 - (L3 ) e-P3 - (L4 ) f -P4 

CC where PI, P2, P3, and P4 = are each independently sequences of 

CC pharmacologically active peptides; LI, L2, L3, and L4 = are each 

CC independently linkers; and a, b, c, d, e, and f = are each independently 

CC 0 or 1, provided that at least 1 of a and b is 1. The composition can 

CC have cytostatic, antiasthmatic, thrombolytic and immunosuppressive 

CC activities. DNAs, vectors and host cells from the present invention can 

CC be used for producing pharmaceutical compositions. The compositions are 

CC useful for treating cancer, asthma, thrombosis, or autoimmune diseases. 

CC The use of an Fc domain (rather than a Fab domain) can provide a longer 

CC half-life or incorporate functions such as Fc receptor binding, protein 

CC A binding, complement fixation, and possibly placental transfer. AAA69443 

CC to AAA69526 and AAB16955 to AAB18003 represent nucleotide and amino acid 

CC sequences used in the exemplification of the present invention. 

XX 

SQ Sequence 243 AA; 



Query Match 100.0%; Score 1238; DB 21; Length 243; 

Best Local Similarity 100.0%; Pred. No. 1.3e-91; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI SRTPEVTCVWDVSHEDPEVKFNWYV 60 

I ■ I ! 1 I ■ 1 ■ I 1 ■ : I I i I I . 1 ! I ' I I ' ! , ! . I . I I 

Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI SRTPEVTCVWDVSHEDPEVKFNWYV 60 



Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

Mill I II II I 1 1 1 M 1 1 1 1 II il I Ml II 1 1 1 II III 1 1 1! 1 1 1 1 II 1 1 1 1 M 1 1 1 II 

Db 61 DGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAP I EKTI SKA 120 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

I II II I III I I I I I I I I I I I lllllll I h I II III! I I II I II II I I I I I I M I I I II 
Db 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IIIIMIIIIIMIIMMIMMIIIIIIIIIIIIIIIIIIIIIMI 

Db 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 



RESULT 12 
ABB73425 

ID ABB73425 standard; Protein; 243 AA. 
XX 

AC ABB73425; 
XX 

DT 05-APR-2002 (first entry) 
XX 

DE Fc-MMP inhibitor fusion nucleic acid SEQ ID NO: 1067. 
XX 

KW Modified peptide; mimetic; Fc domain; fusion; immunoglobulin G; IgG; 

KW EPO; erythropoietin; TPO; tumour necrosis factor alpha inhibitor; 

KW TNF-alpha inhibitor; interleukin 1 antagonist; IL-1 antagonist; TMP; 

KW TPO mimetic peptide; EPO mimetic peptide; EMP; VEGF antagonist; 

KW MMP inhibitor; antiinflammatory; antitumour; immunosuppressive; 

KW cytostatic; antirheumatic; antiarthritic; antidiabetic; ophthalmological ; 

KW antianaemic; anorectic; antiinf ertility ; haemostatic; dermatological ; 

KW neuroprotective; inflammatory disease; autoimmune disease; tumour growth; 

KW cancer; rheumatoid arthritis; diabetic retinopathy; infertility; obesity; 

KW sleep disorder; neurological degenerative disease; anaemia; 

KW thrombocytopaenia; metastatic tumour; systemic lupus erythematosus; 

KW Fanconi's syndrome. 

XX 

OS Homo sapiens . 

OS Synthetic. 
XX 

PN WO200183525-A2. 
XX 

PD 08-NOV-2001. 
XX 

PF 02-MAY-2001; 2 0 01WO-US143 10 . 
XX 

PR 03-MAY-2000; 2 000US- 0563286 . 
XX 

PA (AMGE-) AMGEN INC. 
XX 

PI Feige U, Liu C, Cheetham JC, Boone TC, Gudas JM; 
XX 

DR WPI; 2002-130313/17. 

DR N-PSDB; ABL35775. 
XX 

PT Novel vehicle-peptide molecule or its multimers useful for treating 

PT inflammatory and autoimmune diseases, cancer, rheumatoid arthritis, 



PT diabetic retinopathy, obesity, sleep disorders and infertility - 
XX 

PS Example 7; Fig 25A-B; 176pp; English. 
XX 

CC The present invention describes a vehicle-peptide molecule (I) or its 

CC multimers. (I) can have antiinflammatory, antitumour, immunosuppressive, 

CC cytostatic, antirheumatic, antiarthritic, antidiabetic, ophthalmological , 

CC antianaemic, anorectic, ant iinf ertility , haemostatic, dermatological and 

CC neuroprotective activities. (I) can be used as a therapeutic or 

CC prophylactic agent as well as for screening purposes. (I) is useful for 

CC diagnosing diseases characterised by dysfunction of their associated 

CC protein of interest, for identifying normal or abnormal proteins of 

CC interest, as a part of diagnostic kit to detect the presence of their 

CC proteins of interest in a biological sample. Additionally, (I) is useful 

CC for treating inflammatory and autoimmune diseases, tumour growth, cancer, 

CC rheumatoid arthritis, diabetic retinopathy, obesity, sleep disorders, 

CC infertility, and neurological degenerative diseases. (I), comprising 

CC EPO-mimetic compounds are useful for treating disorders characterised by 

CC low red blood cell levels such as anaemia. The TPO-mimetic comprising 

CC compounds are useful for treating conditions that involve an existing 

CC megakaryocyte/platelet deficiency or an expected megakaryocyte/platelet 

CC deficiency, such as thrombocytopaenia , aplastic anaemia, metastatic 

CC tumour which result in thrombocytopaenia, systemic lupus erythematosus, 

CC and Fanconi 1 s syndrome. ABB72403 to ABB73426 and ABL35695 to ABL35777 

CC represent amino acid and nucleic acid sequences used in the 

CC exemplification of the present invention. 

XX 

SQ Sequence 243 AA; 



Query Match 100.0%; Score 1238; DB 23; Length 243; 

Best Local Similarity 100.0%; Pred. No. 1.3e-91; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

IIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIMMMIIIIIIIIIII 

Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 60 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMMIIIIIIIIIIMI 

Db 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 12 0 



Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 18 0 

IIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIIIIIIIIMII 

Db 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IIIMIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIIII 

Db 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 



RESULT 13 
AAB16958 

ID AAB16958 standard; Protein; 247 AA. 
XX 

AC AAB16958; 
XX 

DT 31-OCT-2000 (first entry) 



XX 

DE Fc-TMP protein sequence SEQ ID NO: 6. 
XX 

KW Modified peptide; therapeutic agent; fusion; Fc domain; cancer; 

KW autoimmune disease; cytostatic; antiasthmatic; thrombolytic; VEGF; 

KW immunosuppressive; EPO; TPO; CTLA4 ; mimetic; IL-1; TNF; antagonist; 

KW MMP; inhibitor; erythropoietin; thrombopoietin; interleukin 1; 

KW cytotoxic T cell lymphocyte antigen 4; tumour necrosis factor; 

KW vascular endothelial growth factor; matrix metal loproteinase; 

KW asthma; thrombosis; pharmaceutical. 
XX 

OS Homo sapiens. 

OS Synthetic. 
XX 

PN WO200024782-A2. 
XX 

PD 04-MAY-2000. 
XX 

PF 25-OCT-1999; 99WO-US25044 . 
XX 

PR 23-OCT-1998; 98US-0105371 . 

PR 22-OCT-1999; 99US-0428082 . 
XX 

PA (AMGE-) AMGEN INC. 
XX 

PI Feige U, Liu C, Cheetham J, Boone TC; 
XX 

DR WPI; 2000-350702/30. 

DR N-PSDB; AAA69444 . 
XX 

PT Novel composition of matter comprising an Fc domain and 

PT pharmacologically active peptides, useful for treating cancer and 

PT autoimmune diseases - 

XX 

PS Claim 21; Page 179-180; 608pp ; English. 
XX 

CC The present invention describes composition of matter (I) comprising an 

CC Fc domain, pharmacologically active peptides, and linkers. Where (I) is: 

CC (Xl)a-Fl- (X2)b, where: Fl = an Fc domain; XI and X2 = are each 

CC independently selected from -(Ll)c-Pl, - (LI) c-Pl- (L2 ) d-P2 , 

CC -(Ll)c-Pl-(L2)d-P2-(L3)e-P A 3, or - (LI) C-Pl- (L2 ) d-P2- (L3 ) e-P3- (L4 ) f -P4 

CC where PI, P2, P3, and P4 = are each independently sequences of 

CC pharmacologically active peptides; LI, L2, L3, and L4 = are each 

CC independently linkers; and a, b, c, d, e, and f = are each independently 

CC 0 or 1, provided that at least 1 of a and b is 1. The composition can 

CC have cytostatic, antiasthmatic, thrombolytic and immunosuppressive 

CC activities. DNAs, vectors and host cells from the present invention can 

CC be used for producing pharmaceutical compositions. The compositions are 

CC useful for treating cancer, asthma, thrombosis, or autoimmune diseases. 

CC The use of an Fc domain (rather than a Fab domain) can provide a longer 

CC half-life or incorporate functions such as Fc receptor binding, protein 

CC A binding, complement fixation, and possibly placental transfer. AAA69443 

CC to AAA69526 and AAB16955 to AAB18003 represent nucleotide and amino acid 

CC sequences used in the exemplification of the present invention. 

XX 

SQ Sequence 247 AA; 



Query Match 100.0%; Score 1238; DB 21; Length 247; 

Best Local Similarity 100.0%; Pred. No. 1.3e-91; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 60 

II I II ,1! II IMUI IIILI 11,111111 II ,1, III hill 

Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVBVSHEDPEVKFNWYV 60 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKA 120 

MIIIIIIIIIIIIIIIIIIIMMIIIIIMIIIMIIIIIIIIIIIMIIIIIIIMI 

Db 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKA 12 0 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

IIIIIIIIIIIIIIIIMIIIIIIIMMIIIIIIIIIIIIIIIIIMIIIIIIIIIIII 

Db 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

MM , I I ,1 M Ml i , | 

Db 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 



RESULT 14 
ABB73411 

ID ABB73411 standard; Protein; 247 AA. 
XX 

AC ABB73411; 
XX 

DT 05-APR-2002 (first entry) 
XX 

DE Fc-TPO mimetic peptide (Fc-TMP) amino acid SEQ ID NO: 6. 
XX 

KW Modified peptide; mimetic; Fc domain; fusion; immunoglobulin G; IgG; 

KW EPO; erythropoietin; TPO; tumour necrosis factor alpha inhibitor; 

KW TNF-alpha inhibitor; interleukin 1 antagonist; IL-1 antagonist; TMP; 

KW TPO mimetic peptide; EPO mimetic peptide; EMP; VEGF antagonist; 

KW MMP inhibitor; antiinflammatory; antitumour; immunosuppressive; 

KW cytostatic; antirheumatic; antiarthritic; antidiabetic; ophthalmological ; 

KW antianaemic; anorectic; antiinf ert ility; haemostatic; dermatological ; 

KW neuroprotective; inflammatory disease; autoimmune disease; tumour growth; 

KW cancer; rheumatoid arthritis; diabetic retinopathy; infertility; obesity; 

KW sleep disorder; neurological degenerative disease; anaemia; 

KW thrombocytopaenia; metastatic tumour; systemic lupus erythematosus; 

KW Fanconi * s syndrome . 

XX 

OS Homo sapiens. 

OS Synthetic. 
XX 

PN WO200183525-A2 . 
XX 

PD 08-NOV-2001. 
XX 

PF 02-MAY-2001; 2001WO-US143 10 . 
XX 

PR 03-MAY-2000; 2000US-0563286 . 
XX 

PA (AMGE-) AMGEN INC. 
XX 



PI Feige U # Liu C # Cheetham JC, Boone TC, Gudas JM; 
XX 

DR WPI; 2002-130313/17. 

DR N-PSDB; ABL35761. 
XX 

PT Novel vehicle-peptide molecule or its multimers useful for treating 

PT inflammatory and autoimmune diseases, cancer, rheumatoid arthritis, 

PT diabetic retinopathy, obesity, sleep disorders and infertility - 
XX 

PS Claim 21; Fig 7; 176pp; English. 
XX 

CC The present invention describes a vehicle-peptide molecule (I) or its 

CC multimers. (I) can have antiinflammatory, antitumour, immunosuppressive, 

CC cytostatic, antirheumatic, antiarthritic , antidiabetic, ophthalmological , 

CC antianaemic, anorectic, antiinf ert ility , haemostatic, dermatological and 

CC neuroprotective activities. (I) can be used as a therapeutic or 

CC prophylactic agent as well as for screening purposes. (I) is useful for 

CC diagnosing diseases characterised by dysfunction of their associated 

CC protein of interest, for identifying normal or abnormal proteins of 

CC interest, as a part of diagnostic kit to detect the presence of their 

CC proteins of interest in a biological sample. Additionally, (I) is useful 

CC for treating inflammatory and autoimmune diseases, tumour growth, cancer, 

CC rheumatoid arthritis, diabetic retinopathy, obesity, sleep disorders, 

CC infertility, and neurological degenerative diseases. (I), comprising 

CC EPO-mimetic compounds are useful for treating disorders characterised by 

CC low red blood cell levels such as anaemia. The TPO-mimetic comprising 

CC compounds are useful for treating conditions that involve an existing 

CC megakaryocyte/platelet deficiency or an expected megakaryocyte/platelet 

CC deficiency, such as thrombocytopaenia , aplastic anaemia, metastatic 

CC tumour which result in thrombocytopaenia, systemic lupus erythematosus, 

CC and Fanconi's syndrome. ABB72403 to ABB73426 and ABL35695 to ABL35777 

CC represent amino acid and nucleic acid sequences used in the 

CC exemplification of the present invention. 

XX 

SQ Sequence 24 7 AA; 



Query Match 100.0%; Score 1238; DB 23; Length 247; 

Best Local Similarity 100.0%; Pred. No. 1.3e-91; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 

Db 


1 
1 


MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 

i' Huh ii ii inn iiiiuh mi i mm,i .illinium 

MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 


60 
60 


Qy 


61 


DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 

IIIIIMIIMMIIIIIIIMIIIIIIIIIIIMIIMIIIIIIIIIMIIIIIIIMI 

DGVEVHNAKTKPREEQYNSTYRWSVLT VLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 


120 


Db 


61 


120 


Qy 


121 


KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 

II ' 1 1 1 Mill II II ' ' i 1 1 1 I : 1 1 1 : 1 < ' 1 1 1 1 

KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 


180 


Db 


121 


180 


Qy 


181 


SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IIIMMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMM 

SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 




Db 


181 





RESULT 15 
AAB17951 

ID AAB17951 standard; Protein; 248 AA. 
XX 

AC AAB17951; 
XX 

DT 31-OCT-2000 (first entry) 
XX 

DE Fc-TNF -alpha inhibitor fusion protein sequence SEQ ID NO: 1056. 
XX 

KW Modified peptide; therapeutic agent; fusion; Fc domain; cancer; 

KW autoimmune disease; cytostatic; antiasthmatic; thrombolytic; VEGF; 

KW immunosuppressive; EPO; TPO; CTLA4 ; mimetic; IL-1; TNF; antagonist; 

KW MMP; inhibitor; erythropoietin; thrombopoietin; interleukin 1; 

KW cytotoxic T cell lymphocyte antigen 4; tumour necrosis factor; 

KW vascular endothelial growth factor; matrix metalloproteinase; 

KW asthma; thrombosis; pharmaceutical. 
XX 

OS Synthetic. 
XX 

PN WO200024782-A2. 
XX 

PD 04-MAY-2000. 
XX 

PF 25-OCT-1999; 99WO-US25 044 . 
XX 

PR 23-OCT-1998; 98US-0105371 . 

PR 22-OCT-1999; 99US-0428082 . 
XX 

PA (AMGE-) AMGEN INC. 
XX 

PI Feige U, Liu C, Cheetham J, Boone TC; 
XX 

DR WPI; 2000-350702/30. 

DR N-PSDB; AAA69501. 
XX 

PT Novel composition of matter comprising an Fc domain and 

PT pharmacologically active peptides, useful for treating cancer and 

PT autoimmune diseases - 

XX 

PS Example 4; Page 568-569; 608pp ; English. 
XX 

CC The present invention describes composition of matter (I) comprising an 

CC Fc domain, pharmacologically active peptides, and linkers. Where (I) is: 

CC (Xl)a-Fl- (X2)b, where: Fl = an Fc domain; XI and X2 = are each 

CC independently selected from -(Ll)c-Pl, - (LI) c-Pl- (L2) d-P2 , 

CC - (Ll)c-Pl- (L2)d-P2- (L3)e-P A 3, or - (LI ) c-Pl- (L2 ) d-P2 - (L3 ) e-P3 - (L4 ) f -P4 

CC where PI, P2 , P3 , and P4 = are each independently sequences of 

CC pharmacologically active peptides; LI, L2 , L3 , and L4 = are each 

CC independently linkers,- and a, b, c, d, e, and f = are each independently 

CC 0 or 1, provided that at least 1 of a and b is 1. The composition can 

CC have cytostatic, antiasthmatic, thrombolytic and immunosuppressive 

CC activities. DNAs, vectors and host cells from the present invention can 

CC be used for producing pharmaceutical compositions. The compositions are 

CC useful for treating cancer, asthma, thrombosis, or autoimmune diseases. 

CC The use of an Fc domain (rather than a Fab domain) can provide a longer 

CC half-life or incorporate functions such as Fc receptor binding, protein 



CC A binding, complement fixation, and possibly placental transfer. AAA69443 

CC to AAA69526 and AAB16955 to AAB18003 represent nucleotide and amino acid 

CC sequences used in the exemplification of the present invention. 
XX 

SQ Sequence 248 AA; 

Query Match 100.0%; Score 1238; DB 21; Length 248; 

Best Local Similarity 100.0%; Pred. No. 1.3e-91; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

IIMIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

IIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIIIIII 

Db 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

IMMIMIIIIIMIMIMIIIMIIIIIIIIIIMIIIIMIIIMIIIIIIIIIII 

Db 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IIIIMIIIIMIIIIIIIIIIMIIIIIIIIIIMIMIIIIIIIII 

Db 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 



Search completed: January 16, 2004, 09:37:55 
Job time : 43 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched: 



protein search, using sw model 

January 16, 2004, 09:36:27 ; Search time 21 Seconds 

(without alignments) 
459.375 Million cell updates/sec 

US-09-843-221A-2 
1238 

1 MDKTHTCPPCPAPELLGGPS MHEALHNHYTQKSLSLSPGK 228 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 
328717 seqs, 42310858 residues 



328717 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 
Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : Issued_Patents_AA: * 

1 : /cgn2_6/ptodata/l/iaa/5A_COMB .pep : 

2 : /cgn2_6/ptodata/l/iaa/5B_COMB .pep : 

3 : /cgn2_6/ptodata/l/iaa/6A_COMB.pep: 

4 : /cgn2_6/ptodata/l/iaa/6B_COMB.pep: 

5 : /cgn2_6/ptodata/l/iaa/PCTUS_COMB.pep: * 

6 : /cgn2_6/ptodata/l/iaa/backf ilesl .pep: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 
NO. 


Score 


Query 
Match 


Length DB 


ID 


Description 


1 


1234 


99 


. 7 


229 


4 


US-09-122-144-2 


Sequence 2, Appli 


2 


1234 


99 


.7 


347 


1 


US-07-940-861-43 


Sequence 43, Appl 


3 


1234 


99 


.7 


347 


1 


US-08-459-512-43 


Sequence 43, Appl 


4 


1234 


99 


7 


347 


2 


US-08-459-657-43 


Sequence 43, Appl 


5 


1234 


99 


7 


347 


2 


US-08-460-132-43 


Sequence 43, Appl 


6 


1234 


99 


7 


347 


3 


US-08-466-465-8 


Sequence 8, Appli 


7 


1234 


99 


7 


347 


5 


PCT-US92-02050-43 


Sequence 43, Appl 


8 


1234 


99 


7 


446 


4 


US-09-157-452B-12 


Sequence 12, Appl 


9 


1234 


99 


7 


482 


4 


US-09-189-129-2 


Sequence 2, Appli 


10 


1233 


99 


6 


232 


2 


US-08-595-043A-50 


Sequence 50, Appl 


11 


1233 


99 


6 


235 


3 


US-09-131-247-6 


Sequence 6, Appli 



12 


1233 


99 


6 


331 


7 

0 


TTC! -09-178-869-2 


C opn i on /-* 0 
O Cl^UCHL c 




13 


1233 


99 






4 


I TCI -09-761 -417-9 
uo w J? /ox ixj £* 


CI pm i on /""• 0 
O Clj C- 


9 7\ rvn,"| -i 


14 


1233 


99 


5 


360 


4 


11^-09-1 80-100-11 

UiJ \J J X O U X V U XX 


O UC11L c 


1 1 Ari-nl 


1 R 


1977 


99 


D 


J / X 


i 

X 


TTC! - 08-976-71 1 - 7 

UiJ UO Z J D J i X / 


C ofti i on r** 0 


7 nr\ 1 H 
/ , /\ppx 1 


16 


1233 


99 




771 

■J 1 X 


"3 
0 


11^-08-4^7 -91 8-7 


C? om 1 on /"* 0 


/ # Hypi X 


17 


1233 


99 




776 


4 


TIC; - 09-1 ftO-1 00-9? 

UO X O U X U U £t 


CI o/"n i on 0 


9 9 Ar^-nil 


18 


1233 


99 




787 


]_ 


T TCI - 08-470-999-4 

UO U O *± / U Z .? J ^± 


CI o^ti i on /— • 0 


A A T^T^> "1 -1 

*± t ±\yy± 1 


19 


1233 


99 




388 


3 


UCI-09 -1 71 -947-16 


CI pm i on r* 0 


1 6 Annl 

X D | ripp X 


20 


1233 


99 




389 


3 


TT c J-09-1 71 -947-14 


CI om i on f* 0 

O CU^LLdlU c 


1 4 A-o-n") 
x*± ; nppi 


21 


1233 


99 




396 


2 


1^-08-784-^19-7 


CI om i on 0 


3 / Appl i 


22 


1233 


99 


6 


396 


3 


US -09 -176-228 -7 


CI ocn i on r* 0 


7 & nn "1 -i 
O / ripp± x 


23 


1233 


99 


6 


424 


4 


US-09-777-597A-8 


CI om l on r - " 0 

O LICl lk_ c 


0 , ri^jpi X 


24 


1233 


99 


6 


424 


5 


PCT-US95-07 8 66-12 


cj om i on c 0 


1 9 annl 


25 


1233 


99 




424 


5 


PrT-TTc;9R-07 8 66-1 4 


Q o/ti i on 0 
O C^UCllL c 


1 4 A-rvnl 

x *± , >\pp X 


26 


1233 


99 




437 


5 


PPT-TT c ;96-1 0047-1 1 

i V_ J. U lJ vj X U U J XX 


CI om i on f"" 0 
O C^UCllL C 


1 1 Ann 1 


27 


1233 


99 




442 


w> 


PPT-TTCI96-1 0 047-9 
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Sequence 
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ALIGNMENTS 



RESULT 1 
US-09-122-144-2 

; Sequence 2, Application US/09122144A 

; Patent No. 6485726 

; GENERAL INFORMATION: 

; APPLICANT: Blumberg, Richard S. 

; APPLICANT: Simister, Neil E. 

; APPLICANT: Lencer, Wayne I. 

; TITLE OF INVENTION: RECEPTOR SPECIFIC TRANS E P I THEL I AL TRANSPORT OF 
THERAPEUTICS 

; FILE REFERENCE: S1383/7003 

; CURRENT APPLICATION NUMBER: US/09/122 , 144A 

; CURRENT FILING DATE: 1998-07-24 

; EARLIER APPLICATION NUMBER: US 09/122,144 

; EARLIER FILING DATE: 1998-07-24 

; NUMBER OF SEQ ID NOS : 22 

; SEQ ID NO 2 

LENGTH: 22 9 



TYPE: PRT 

ORGANISM : Homo sapiens 
US-09-122-144-2 



Query Match 99.7%; Score 1234; DB 4; Length 229; 

Best Local Similarity 99.6%; Pred. No. 3.5e-117; 

Matches 227; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEvTTCVVVDVSHEDPEVKFNWYV 60 

: II Mill II 1 1 III MINIM I II MM IN II II Mill II II MINIMI Mill 

Db 2 VDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 61 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKA 120 

IMIIINIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIINIIIIIINIII 

Db 62 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKA 121 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

I II I I II I II I I I N II I I I I I I I I I II I I N I I I I I II I I II I II I I I I I I II II II II 
Db 122 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 181 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

I I I I I N I II I I I I I I I I I I II N II II II I I I I I I I I I I II I I I I I I 
Db 182 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 229 



RESULT 2 

US-07-940-861-43 

Sequence 43, Application US/07940861 
Patent No. 5547853 
GENERAL INFORMATION: 

APPLICANT: BIOGEN, INC. 
APPLICANT: WALLNER, Barbara P. 
APPLICANT: MILLER, Glenn T. 
APPLICANT: ROSA, Margaret D. 

TITLE OF INVENTION: CD2 -BINDING DOMAIN OF LYMPHOCYTE 
TITLE OF INVENTION: FUNCTION ASSOCIATED ANTIGEN 3 
NUMBER OF SEQUENCES: 43 
CORRESPONDENCE ADDRESS: 
ADDRESSEE: Fish 6c Neave 
STREET: 875 Third Avenue 
CITY: New York 
STATE: New York 
COUNTRY: U.S.A. 
ZIP: 10022-6250 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patent In Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/07/940 , 861 
FILING DATE: 21 -OCT- 19 92 
CLASSIFICATION: 514 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/US92/02 05 0 
FILING DATE: 12-MAR-1992 
PRIOR APPLICATION DATA: 



APPLICATION NUMBER: US 07/667,971 

FILING DATE: 12-MAR-1991 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/770,967 

FILING DATE: 07-OCT-1991 
ATTORNEY/AGENT INFORMATION: 

NAME: HALEY, James F. , Jr. 

REGISTRATION NUMBER: 27,794 

REFERENCE/DOCKET NUMBER: B151CIP2 
TELECOMMUNICATION ' INFORMATION : 

TELEPHONE: (212)715-0600 

TELEFAX: (212)715-0673 

TELEX: 14-8367 
; INFORMATION FOR SEQ ID NO: 43: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 347 amino acids 

TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-07-940-861-43 



Query Match 99.7%; Score 1234; DB 1; Length 347; 

Best Local Similarity 99.6%; Pred. No. 6.3e-117; 

Matches 227; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVPLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 60 

UIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIIIMMIIIIIIIMIIMIMI 

Db 120 VTDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 179 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 12 0 

IMIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIMIIIIIIIIIIIIMI 

Db 180 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 239 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

IIMIIIMIIMIIMIIIIMIIIIIIMIIIIIIIIIIIMMIIIIIMIIIIIII 

Db 240 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 299 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIII 

Db 300 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 347 



RESULT 3 

US-08-459-512-43 

Sequence 43, Application US/08459512 
Patent No. 5728677 
GENERAL INFORMATION: 

APPLICANT: BIOGEN, INC. 
APPLICANT: WALLNER, Barbara P. 
APPLICANT: MILLER, Glenn T. 
APPLICANT: ROSA, Margaret D. 

TITLE OF INVENTION: CD2 -BINDING DOMAIN OF LYMPHOCYTE 
TITLE OF INVENTION: FUNCTION ASSOCIATED ANTIGEN 3 
NUMBER OF SEQUENCES: 43 
CORRESPONDENCE ADDRESS : 
ADDRESSEE: Fish & Neave 
STREET: 875 Third Avenue 



CITY: New York 

STATE: New York 

COUNTRY: U.S.A. 

ZIP : 10022-6250 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Patent In Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/45 9,512 

FILING DATE: 02-JUN-1995 

CLASSIFICATION: 514 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/US92/02 050 

FILING DATE: 12-MAR-1992 

APPLICATION NUMBER: US 07/667,971 

FILING DATE: 12-MAR-1991 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/770,967 

FILING DATE: 07-OCT-1991 
ATTORNEY/AGENT INFORMATION: 

NAME: HALEY, James F., Jr. 

REGISTRATION NUMBER: 27,794 

REFERENCE/DOCKET NUMBER: B151CIP2 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (212)715-0600 

TELEFAX: (212)715-0673 

TELEX: 14-8367 
INFORMATION FOR SEQ ID NO: 43: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 347 amino acids 

TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-459-512-43 



Query Match 99.7%; Score 1234; DB 1; Length 347; 

Best Local Similarity 99.6%; Pred. No. 6.3e-117; 

Matches 227; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

I M .11 I! IIMI II I IMIill IN [| II II Mllll M 

Db 12 0 VI)KTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCWVDVSHEDPEVKFNWYV 179 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

MM 1 1 M Mi, I i III ,IM I I M I IMMM II I 

Db 180 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 239 

QY 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 18 0 

MM 1 MM Mi, T, MiMi,, hi, LM-IMllM MMi 

Db 24 0 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 2 99 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI 

Db 300 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 347 



RESULT 4 

US-08-459-657-43 

Sequence 43, Application US/08459657 
Patent No. 5914111 
GENERAL INFORMATION: 

APPLICANT: BIOGEN, INC. 
APPLICANT: WALLNER , Barbara P. 
APPLICANT: MILLER, Glenn T. 
APPLICANT: ROSA, Margaret D. 

TITLE OF INVENTION: CD2 -BINDING DOMAIN OF LYMPHOCYTE 
TITLE OF INVENTION: FUNCTION ASSOCIATED ANTIGEN 3 
NUMBER OF SEQUENCES: 43 
CORRESPONDENCE ADDRESS : 
ADDRESSEE: Fish & Neave 
STREET: 875 Third Avenue 
CITY: New York 
STATE: New York 
COUNTRY: U.S.A. 
ZIP: 10022-6250 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/459 , 657 
FILING DATE: 02-JUN-1995 
CLASSIFICATION : 424 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/US92/02 050 
FILING DATE: 12 -MAR- 1992 
APPLICATION NUMBER: US 07/667,971 
FILING DATE: 12-MAR-1991 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/770,967 
FILING DATE: 07 -OCT- 1991 
ATTORNEY/AGENT INFORMATION: 
NAME: HALEY, James F., Jr. 
REGISTRATION NUMBER: 27,794 
REFERENCE/DOCKET NUMBER: B151CIP2 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: ( 2 12 ) 715 - 0600 
TELEFAX: (212 ) 715-0673 
TELEX: 14-8367 
INFORMATION FOR SEQ ID NO: 43: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 347 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-459-657-43 



Query Match 99.7%; 
Best Local Similarity 99.6%; 
Matches 227; Conservative 



Score 1234; DB 2; Length 347 
Pred. No. 6.3e-117; 
1; Mismatches 0; Indels 



Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

HIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIMI 

Db 120 VDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 179 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

IMIIIIMIIMIIIIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIII 

Db 180 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 239 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

: I ! I : ! i I ; I : ! I : i ! I i ! 

Db 24 0 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 299 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIII 

Db 300 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 347 

RESULT 5 

US-08-460-132-43 

Sequence 43, Application US/08460132 
Patent No. 5928643 
GENERAL INFORMATION: 

APPLICANT: BIOGEN, INC. 
APPLICANT: WALLNER , Barbara P. 
APPLICANT: MILLER, Glenn T. 
APPLICANT: ROSA, Margaret D. 

TITLE OF INVENTION: CD2 -BINDING DOMAIN OF LYMPHOCYTE 
TITLE OF INVENTION: FUNCTION ASSOCIATED ANTIGEN 3 
NUMBER OF SEQUENCES: 43 
CORRESPONDENCE ADDRESS : 
ADDRESSEE: Fish & Neave 
STREET: 875 Third Avenue 
CITY: New York 
STATE: New York 
COUNTRY: U.S.A. 
ZIP: 10022-6250 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patent In Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/4 60,132 
FILING DATE: 02-JUN-1995 
CLASSIFICATION: 424 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/940,861 
FILING DATE: 21 -OCT- 19 92 
APPLICATION NUMBER: PCT/US92/02050 
FILING DATE: 12-MAR-1992 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/667,971 
FILING DATE: 12-MAR-1991 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/770,967 
FILING DATE: 07-OCT-1991 
ATTORNEY/AGENT I NFORMATI ON : 



NAME: HALEY, James F . , Jr. 
REGISTRATION NUMBER: 27,794 
REFERENCE/DOCKET NUMBER: B151CIP2 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: ( 2 12 ) 7 15 - 0600 
TELEFAX: ( 2 12 ) 7 15 - 0673 
TELEX: 14-8367 
INFORMATION FOR SEQ ID NO: 43: 
SEQUENCE CHARACTER I ST I CS : 
LENGTH: 347 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE : protein 
US-08-460-132-43 

Query Match 99.7%; Score 1234; DB 2; Length 347; 

Best Local Similarity 99.6%; Pred. No. 6.3e-117; 

Matches 227; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

: 1 1 IE. I 'll i 1 1 1 ' l 'II ll 1 1 i ! 

Db 120 VDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 179 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWS VLTVLHQDWLNGKE YKCKVSNKALPAP I EKT I S KA 120 

MIMIIMM II MUM IMII MIMM II IMIM MMMMIMI 

Db 18 0 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 239 

QY 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

IIIIIIIIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI 

Db 24 0 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 299 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIII 

Db 300 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 347 



RESULT 6 
US-08-466-465-8 

; Sequence 8, Application US/08466465 

; Patent No. 6162432 

; GENERAL INFORMATION: 

APPLICANT: Wallner, Barbara P. 

APPLICANT: Cooper, Kevin D. 

TITLE OF INVENTION: Method of Prophylaxis or Treatment of Antigen 
TITLE OF INVENTION: Presenting Cell Driven Skin Conditions Using 
TITLE OF INVENTION: Inhibitors of the CD2/LFA-3 Interaction 
NUMBER OF SEQUENCES: 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: LAHIVE & COCKFIELD 

STREET: 60 State Street, Suite 510 

CITY: Boston 

STATE: Massachusetts 

COUNTRY : USA 

ZIP : 02109-1875 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 



OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patent In Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08 /4 66 , 465 
FILING DATE: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/US92/08755 
FILING DATE: 06-OCT-1992 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER:. US 07/862,022 
FILING DATE: 12-APR-1992 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/770,969 
FILING DATE: 07-OCT-1991 
ATTORNEY/AGENT INFORMATION: 
NAME: Myers, Louis (PLM) 
REGISTRATION NUMBER: 35,965 
REFERENCE /DOCKET NUMBER: BGP-111CP 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (617) 227-7400 
TELEFAX: (617) 227-5941 
INFORMATION FOR SEQ ID NO: 8: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 347 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-466-465-8 

Query Match 99.7%; Score 1234; DB 3; Length 347; 

Best Local Similarity 99.6%; Pred. No. 6.3e-117; 

Matches 227; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

UIIIIIIMIIIIIMIMIIIIIIIMMIIIIIIIIIIIIMIIIIIIMIIIIIII 

Db 120 VDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 179 

QY 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKA 120 

IIIIIIIIIIIIIIIIMIIIIIIIIIMIIMIIMIMIMIIMIIIIIIIIIIMI 

Db 180 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 239 

QY 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

IIIIIMIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIMIIIIIMIIII 

Db 240 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 299 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IIIIIIIIMIIIIIIIIMIIIMIIIIIIIMIIIIIIIIIIIIII 

Db 3 00 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 347 



RESULT 7 

PCT-US92-02050-43 

; Sequence 43, Application PC/TUS9202050 
; GENERAL INFORMATION: 

APPLICANT: BIOGEN, INC. 

APPLICANT: WALLNER, Barbara P. 

APPLICANT: MILLER, Glenn T. 



APPLICANT: ROSA, Margaret D. 

TITLE OF INVENTION: CD2 -BINDING DOMAIN OF LYMPHOCYTE 
TITLE OF INVENTION: FUNCTION ASSOCIATED ANTIGEN 3 
NUMBER OF SEQUENCES: 43 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Fish & Neave 

STREET: 875 Third Avenue 

CITY: New York 

STATE: New York 

COUNTRY: U.S.A. 

ZIP: 10022-6250 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patent In Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: PCT/US92 /02050 

FILING DATE: 19920312 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/667,971 

FILING DATE: 12-MAR-1991 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/770,967 

FILING DATE: 07-OCT-1991 
ATTORNEY/AGENT INFORMATION: 

NAME: HALEY, James F., Jr. 

REGISTRATION NUMBER: 27,794 

REFERENCE/DOCKET NUMBER: B151CIP2 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (212)715-0600 

TELEFAX: (212)715-0673 

TELEX: 14-8367 
INFORMATION FOR SEQ ID NO: 43: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 34 7 amino acids 

TYPE: AMINO ACID 
■ TOPOLOGY: linear 
MOLECULE TYPE: protein 
PCT-US92-02050-43 

Query Match 99.7%; Score 1234; DB 5; Length 347; 

Best Local Similarity 99.6%; Pred. No. 6.3e-117; 

Matches 227; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 60 



Db 




Qy 



61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 



Db 




Qy 



121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 



Db 




Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IIIIIIIIIIIIIIMIIIIMIMIIIIIIIIMIIIIIIMIIIII 

Db 300 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 347 



RESULT 8 

US-09-157-452B-12 

; Sequence 12, Application US/09157452B 

; Patent No. 6482409 

; GENERAL INFORMATION: 

; APPLICANT: Lobb, Roy R. 

; APPLICANT: Burkly, Linda C. 

; TITLE OF INVENTION: TREATMENT FOR INFLAMMATORY BOWEL DISEASE 
; FILE REFERENCE: 10274-004003 

CURRENT APPLICATION NUMBER: US/09/157 , 4 52B 
; CURRENT FILING DATE: 1998-09-21 
; PRIOR APPLICATION NUMBER: US 08/950,660 
; PRIOR FILING DATE: 1997-10-15 
; PRIOR APPLICATION NUMBER: US 08/373,857 
; PRIOR FILING DATE: 1995-01-18 
/ PRIOR APPLICATION NUMBER: US 08/284,603 
; PRIOR FILING DATE: 1994-08-11 
; PRIOR APPLICATION NUMBER: PCT/US93/00924 
; PRIOR FILING DATE: 1993-02-02 
; PRIOR APPLICATION NUMBER: US 07/835,139 
; PRIOR FILING DATE: 1992-02-12 
; NUMBER OF SEQ ID NOS : 16 

; SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 12 

LENGTH: 446 

TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-157-452B-12 



Query Match 99.7%; Score 1234; DB 4; Length 446; 

Best Local Similarity 99.6%; Pred. No. 9.1e-117; 

Matches 227; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

M M I II II. MUM I Ml M III II MMMM Ml M 

VT3KTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 278 
DGVE VTINAKTKPREEQYNSTYRWS VLTVLHQDWLNGKE YKCKVSNKALPAP I EKTI SKA 120 

Mill I Ml M II MMM II IMMM II II MM I Ml M 

DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKE YKCKVSNKALPAPI EKTI SKA 3 3 8 
KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

MIIIIMIIIIIMIIIIIIIIIIMIIIIIIIMIIIIIIIIMIIIIIIMIIMII 

KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 398 
SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IIMIIMIIIIMIIIIIIIIIIMIIIIIIIIMMIIIMIMM 

SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 446 



Qy 


1 


Db 


219 


Qy 


61 


Db 


279 


Qy 


121 


Db 


339 


Qy 


181 


Db 


399 



RESULT 9 



US-09-189-129-2 

Sequence 2, Application US/09189129 
Patent No. 6323027 
GENERAL INFORMATION: 

APPLICANT: Burkly, Linda C 
APPLICANT : Benjamin, Christopher D 
APPLICANT: Hession, Catherine A 
APPLICANT: Whitty, Adrian 

TITLE OF INVENTION: COMMON GAMMA CHAIN BLOCKING AGENTS 
NUMBER OF SEQUENCES: 17 
CORRESPONDENCE ADDRESS : 
ADDRESSEE: Biogen, Inc. 
STREET: 14 Cambridge Center 
CITY: Cambridge 
STATE: Massachusetts 
COUNTRY : USA 
ZIP : 02142 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patent In Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 09/18 9 , 12 9 
FILING DATE: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: AO 06 PCT CIP 
FILING DATE: 09-MAY-1997 
APPLICATION NUMBER: 60/017,466 
FILING DATE: 10-MAY-1996 
ATTORNEY/AGENT INFORMATION: 
NAME: Kaplan, Warren A. 
REGISTRATION NUMBER: 34,199 
REFERENCE/DOCKET NUMBER: AO 06 PCT CIP 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 617 679-2000 
TELEFAX: 617 679-2838 
INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 4 82 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: 
HYPOTHETICAL: YES 
ANTI -SENSE: NO 
US-09-189-129-2 

Query Match 99.7%; Score 1234; DB 4; Length 482; 

Best Local Similarity 99.6%; Pred. No. le-116; 

Matches 227; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

UIIIIIMIIMIIMIMIIMIIMIIIIIIMIIIIIMIIIIIIIMIIIIIIII 

Db 255 VDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 314 



Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 12 0 



IIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIIIIII 

Db 315 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 374 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

IIIIIIIIIIIMIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIII 

Db 375 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 434 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IIIIIIMIIIMIIIIIIIMIIIIIMIIIIIIIIIIIIIIIIIM 

Db 435 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 482 



RESULT 10 
US-08-595-043A-50 

; Sequence 50, Application US/08595043A 

; Patent No. 5935824 

; GENERAL INFORMATION: 

APPLICANT: SGARLATO, GREGORY D. 

TITLE OF INVENTION: PROTEIN EXPRESSION SYSTEM 
NUMBER OF SEQUENCES: 90 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: MEDLEN & CARROLL 

STREET: 22 0 MONTGOMERY STREET, SUITE 22 00 
CITY: SAN FRANCISCO 
STATE : CALI FORNI A 

COUNTRY: UNITED STATES OF AMERICA 

ZIP : 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/5 95 , 043A 
. FILING DATE: 31 -JAN- 19 96 

CLASSIFICATION: 435 
ATTORNEY/AGENT INFORMATION: 

NAME: CARROLL, PETER G. 

REGISTRATION NUMBER: 32,837 

REFERENCE/DOCKET NUMBER: SGAR-00371 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (415) 705-8410 

TELEFAX: (415) 397-8338 
; INFORMATION FOR SEQ ID NO: 50: 
SEQUENCE CHARACTERISTICS : 
LENGTH: 232 amino acids 

TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-595-043A-50 

Query Match 99.6%; Score 1233; DB 2; Length 232; 

Best Local Similarity 100.0%; Pred. No. 4.5e-117; 

Matches 227; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 2 DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVD 61 

I. II. MINI Ml II II III. I\ II I II 'I MM, MM! 



Db 6 DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVD 65 

Qy 62 GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAK 121 

IIIIIMIMIIIIIMIIIMIIIIMIIIIMIIIIIIIIIIIIIIIMIIIIIIIII 

Db 66 GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAK 125 

Qy 122 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 181 

■ !' 'I; . : 

Db 126 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 185 

Qy 182 DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IN II ill IIMIL II II II :lll II II 

Db 186 DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 232 



RESULT 11 
US-09-131-247-6 

; Sequence 6, Application US/09131247 

; Patent No. 6294170 

; GENERAL INFORMATION: 

; APPLICANT: Boone, Thomas C. 

; APPLICANT: Hershenson, Susan 

; APPLICANT: Bevilacqua, Michael P. 

; APPLICANT: Collins, David S. 

; TITLE OF INVENTION: COMPOSITION AND METHOD FOR TREATING INFLAMMATORY 
; TITLE OF INVENTION: DISEASES 
FILE REFERENCE: A-365F 

CURRENT APPLICATION NUMBER: US/09/131,247 
CURRENT FILING DATE: 1998-08-07 
; EARLIER APPLICATION NUMBER: 60/055,185 
; EARLIER FILING DATE: 1997-08-08 
; EARLIER APPLICATION NUMBER: PCT/US 97/02131 
; EARLIER FILING DATE: 1997-02-10 
; NUMBER OF SEQ ID NOS : 16 

SOFTWARE: Patentln Ver. 2.0 
; SEQ ID NO 6 

LENGTH: 235 
TYPE : PRT 
ORGANISM: Human 
US-09-131-247-6 

Query Match 99.6%; Score 1233; DB 3; Length 235; 

Best Local Similarity 100.0%; Pred. No. 4.6e-117; 

Matches 22 7 ; Conservative 0 ; Mismatches 0 ; Indels 0 ; Gaps 0 ; 
Qy 2 DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCWVDVSHEDPEVKFNWYVD 61 

MIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIMM 

Db 9 DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVD 68 

Qy 62 GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAK 121 

IIIIIIIIMIIMIIIMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMMIII 

Db 69 GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAK 128 

Qy 122 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 181 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

Db 12 9 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 188 



Qy 182 DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

ii ii in: . I i. i , i: m . 

Db 18 9 DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 235 



RESULT 12 
US-09-178-869-2 

; Sequence 2, Application US/09178869B 

; Patent No. 6197294 

; GENERAL INFORMATION: 

; APPLICANT: Tao, Weng 

; APPLICANT: Wong, Shou 

; APPLICANT: Hickey, William F 

; APPLICANT: Hammang, Joseph P. 

; APPLICANT: Baetge, E. Edward 

; TITLE OF INVENTION: CELL SURFACE- INDUCED MACROPHAGE ACTIVATION 
; FILE REFERENCE: 17810-043 

; CURRENT APPLICATION NUMBER: US/09/ 178 , 8 69B 
; CURRENT FILING DATE: 1998-10-26 
; NUMBER OF SEQ ID NOS : 14 

SOFTWARE: Patent In Ver. 2.0 
; SEQ ID NO 2 

LENGTH: 331 

TYPE : PRT 

ORGANISM: Homo sapiens 
US-09-178-869-2 



Query Match 99.6%; Score 1233; DB 3; Length 331; 

Best Local Similarity 100.0%; Pred. No. 7.5e-117; 

Matches 227; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


2 


DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVD 


61 






IIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIIII 




Db 


105 


DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEWCVVVDVSHEDPEVKFNWYVD 


164 


Qy 


62 


GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAK 


121 






M Ml mm 1 :n:: 1 : Ml M II I Miiiiiii i ii; 




Db 


165 


GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAK 


224 


Qy 


122 


GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 


181 






Mull. MMM II II MMM III MMMMIMM Mill III 




Db 


225 


GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 


284 


Qy 


182 


DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 








II IMMMII MM II II Ml III II Ml II M 




Db 


285 


DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 331 





RESULT 13 
US-09-761-413-2 

; Sequence 2, Application US/09761413 

; Patent No. 6506891 

; GENERAL INFORMATION: 

; APPLICANT: Tao, Weng 

; APPLICANT: Wong, Shou 

; APPLICANT: Hickey, William F 

; APPLICANT: Hammang, Joseph P. 



; APPLICANT : Baetge, E. Edward 

; TITLE OF INVENTION: CELL SURFACE -INDUCED MACROPHAGE ACTIVATION 
; FILE REFERENCE: 17810-043 

/ CURRENT APPLICATION NUMBER: US/09/761,413 

; CURRENT FILING DATE: 2001-01-16 

; PRIOR APPLICATION NUMBER: US/09/178 , 869 

; PRIOR FILING DATE: 1998-10-26 

; NUMBER OF SEQ ID NOS : 14 

; SOFTWARE: Patentln Ver. 2.0 

; SEQ ID NO 2 

LENGTH: 331 

TYPE : PRT 

ORGANISM: Homo sapiens 
US-09-761-413-2 



Query Match 99.6%; Score 1233; DB 4; Length 331; 

Best Local Similarity 100.0%; Pred. No. 7.5e-117; 

Matches 227; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


2 


DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVD 


61 






IIIIIIIIMIIMIIIIIIIIIIIMIIIIIIIIIMIIMIIIIIMIIIIIIIIIII 




Db 


105 


DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVD 


164 


Qy 


62 


GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAK 121 






IIIIIIIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


165 


GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAK 224 


Qy 


122 


GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 


181 






Mllllllllll Ml 1 II II' 1 IMIMI II MM IIMIIIII! 




Db 


225 


GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 


284 


Qy 


182 


DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 








IIIIIIIIIIIIIIIIMIIIIIIIMIIIIMIIIIIIIIIMIII 




Db 


285 


DGS F FL YS KLTVDKS RWQQGN VFS CS VMHEALHNH YTQ KS LS LS PGK 331 




RESULT 14 








US-09-180- 


-100- 


-11 





; Sequence 11, Application US/09180100 
; Patent No. 63 06395 
; GENERAL INFORMATION: 

; APPLICANT: NAKAMURA , No. 63 063 9 5 io 
; APPLICANT: NAGATA, Shigekazu 

; TITLE OF INVENTION: NOVEL Fas ANTIGEN DERIVATIVE 
; FILE REFERENCE: 1110-207P 

CURRENT APPLICATION NUMBER: US/09/18 0,100 
; CURRENT FILING DATE: 1998-11-02 
; EARLIER APPLICATION NUMBER: PCT/ JP97/01502 

EARLIER FILING DATE: 1997-05-01 
; NUMBER OF SEQ ID NOS: 25 
; SOFTWARE: Patentln Ver. 2.0 
; SEQ ID NO 11 
LENGTH: 3 60 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-180-100-11 



Query Match 99.6%; Score 1233; DB 4; Length 360; 

Best Local Similarity 100.0%; Pred. No. 8.4e-117; 

Matches 227; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 2 DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVD 61 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIII 

Db 134 DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVD 193 

Qy 62 GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAK 121 

MIIIIIIIIIIIIIIIIIIIIIMIIIIIIIMMMIIIIIIIIIIIIIIIIIIIIII 

Db 194 GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAK 253 

Qy 122 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 181 

I Ml IM II 1 1 Mill II III II III II Ml Mill II Mill M llllll M Mill II 

Db 254 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 313 

Qy 182 DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IIMMMIMMIIMMMIIMMMMMIIIMIMIIMM 

Db 314 DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 360 



RESULT 15 
US-08-236-311-7 

Sequence 7, Application US/08236311 
Patent No. 5565335 
GENERAL INFORMATION: 

APPLICANT: Capon, Daniel J. 
APPLICANT: Gregory, Timothy J. 
TITLE OF INVENTION: Adheson Variants 
NUMBER OF SEQUENCES: 25 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: Genentech, Inc. 
STREET: 460 Point San Bruno Blvd 
CITY: South San Francisco 
STATE: California 
COUNTRY : USA 
ZIP: 94080 
COMPUTER READABLE FORM: 

MEDIUM TYPE: 5.25 inch, 360 Kb floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS -DOS 
SOFTWARE: patin (Genentech) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/236,311 
FILING DATE: 02 -MAY- 1994 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 07/936190 
FILING DATE: 26-AUG-1992 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 07/842777 
FILING DATE: 18-FEB-1992 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 07/250785 
FILING DATE: 28-SEP-1988 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 07/104329 



FILING DATE: 02-OCT-1987 
ATTORNEY/AGENT INFORMATION: 

NAME: Hasak, Janet E. 

REGISTRATION NUMBER: 28,616 

REFERENCE/DOCKET NUMBER: 444P1C2 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415/225-1896 

TELEFAX: 415/952-9881 

TELEX: 910/371-7168 
; INFORMATION FOR SEQ ID NO: 7: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 371 amino acids 

TYPE: amino acid 

TOPOLOGY: linear 
US-08-236-311-7 



Query Match 99.6%; Score 1233; DB 1; Length 371; 

Best Local Similarity 100.0%; Pred. No. 8.8e-117; 

Matches 227; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


2 


DKTHTCP PCPAPELLGGPS VFLF P PKPKDTLM I SRTPEOTC WVDVSHEDPE VKFNW YVD 


61 






IMIIIIIIIIIMIIIMIIIIMIMIIIIIIIIIIIIIIIIIIMIIIIIIIIMM 




Db 


145 


DKTHTCP PCPAPELLGGPS VFLF P PKPKDTLM I SRTPEWCVVVDVSHEDPEVKFNWYVD 


204 


Qy 


62 


GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAK 


121 






IIIIIIMIIIIIIIIIIIIMIIIMIIIIIIIIIIIIIMMIIIIMIIIIIIIIII 




Db 


205 


GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAK 


264 


Qy 


122 


GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 


181 






llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


265 


GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 


324 


Qy 


182 


DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 








MMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIII 




Db 


325 


DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 371 




Search 


completed: January 16, 2004, 09:39:18 




Job time : 22 


sees 





GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



January 16, 2004, 09:36:27 ; Search time 21 Seconds 

(without alignments) 
1044.117 Million cell updates/sec 

US-09-843-221A-2 
1238 

1 MDKTHTCPPCPAPELLGGPS MHEALHNHYTQKSLSLSPGK 228 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 283308 seqs, 96168682 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



283308 



Database 



PIR_76:* 
pirl : * 
pir2 : * 
pir3 : * 
pir4 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



Query 



NO. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


1233 


99 


.6 


255 


4 


S31866 


Ig gamma-1 chain C 


2 


1233 


99 


6 


330 


1 


GHHU 


Ig gamma-1 chain C 


3 


1227 


99 


1 


374 


2 


S69339 


Ig heavy chain V r 


4 


1180 


95 


3 


234 


2 


PT0207 


Ig gamma chain C r 


5 


1146 


92 


6 


377 


2 


A23511 


Ig gamma -3 chain C 


6 


1144 


92 


4 


377 


2 


A60764 


Ig gamma -3 chain C 


7 


1142 . 5 


92 


3 


326 


1 


G2HU 


Ig gamma -2 chain C 


8 


1135 


91 


7 


327 


1 


G4HU 


Ig gamma -4 chain C 


9 


1121 


90 


5 


289 


1 


G3HUWI 


Ig gamma -3 heavy c 


10 


918 .5 


74 


2 


323 


1 


GHRB 


Ig gamma chain C r 


11 


906.5 


73 


2 


328 


2 


147160 


Ig gamma 2b chain 


12 


906.5 


73 


2 


328 


2 


147159 


Ig gamma 2a chain 


13 


903.5 


73 


0 


277 


2 


147162 
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ALIGNMENTS 



RESULT 1 
S31866 

Ig gamma- 1 chain C region - synthetic 
C; Species : synthetic 

A; Note: Homo sapiens (man) gene engineered and expressed in Escherichia coli 
C;Date: 06-Jan-1995 #sequence_revision 17-Mar-1997 #text_change 19-May-2000 
C; Access ion: S3 18 66 
R;Filpula, D. 

submitted to the EMBL Data Library, February 1993 

A; Description: Screeing method for protein-protein interactions of cloned gene 
products . 

A; Reference number: S31866 
A /Access ion: S3 18 66 
A; Molecule type: mRNA 
A;Residues: 1-255 <FIL> 

A; Cross -references : EMBL:X70421; NID:g33068; PIDN : CAA4 9866 . 1 ; PID:g33069 
C; Keywords : immunoglobulin 

F; 1-2 2 /Region: Escherichia coli outer membrane protein A precursor 
F; 23 -255/Region : human Ig gamma-1 chain C region 



Query Match 99.6%; Score 1233; DB 4; Length 255; 

Best Local Similarity 100.0%; Pred. No. 6.8e-89; 

Matches 227; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 2 DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVD 61 

IIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 29 DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVD 88 

Qy 62 GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAK 121 

IIIIIIIIIIIIIIIIIMIMIIIIIIIIIIIIMIIIIIIIMMIIMIIIIIIIII 

Db 8 9 GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAK 148 

Qy 122 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 181 

IIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIII 

Db 14 9 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 2 08 

Qy 182 DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IIIIIIIIIIIIIIIIIMIIIIIMIIMIIIIIIIIIIIIIIIII 

Db 209 DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 255 

RESULT 2 
GHHU 

Ig gamma-1 chain C region - human 
C; Species: Homo sapiens (man) 

C;Date: 31-Jan-1981 #sequence_revision 18-Aug-1982 #text_change 16-Jul-1999 
C;Accession: A93433; S36861; S33887; B90563; A90564; B91668; A91723; A02146 
R;Ellison, J.W. ; Berson, B.J.; Hood, L.E. 
Nucleic Acids Res. 10, 4071-4079, 1982 

A;Title: The nucleotide sequence of a human immunoglobulin C-gammal gene. 

A;Reference number: A93433; MUID : 82274238 ; PMID: 6287432 

A;Accession: A93433 

A; Molecule type: DNA 

A; Residues: 1-330 <ELL> 

A; Cross-references : EMBL: Z17370 

A;Note: this sequence has the Glm(17) allotypic marker, 97-Lys, and the Glm(l) 

markers, 23 9 -Asp and 241 -Leu 

A;Note: Lys-330 is removed after translation 

R; Harris, L.J. 

submitted to the EMBL Data Library, October 1992 

A; Reference number: S33904 

A; Access ion: S3 68 61 

A; Molecule type: DNA 

A; Residues: 2-330 <HAR> 

A;Cross-references : EMBL: Z17370 

R,-Takahashi, N. ; Ueda, S.; Obata, M. ; Nikaido, T. ; Nakai, S.; Honjo, T. 
Cell 29, 671-679, 1982 

A;Title: Structure of human immunoglobulin gamma genes: implications for 
evolution of a gene family. 

A;Reference number: S33887; MUID : 83001943 ; PMID: 6811139 
A; Access ion: S33887 
A; Molecule type: DNA 
A;ResidueS: 88-113;235-330 <TAK> 
A; Cross-references : EMBL:Z17370 

R;Cunningham, B.A.; Rutishauser, U. ; Gall, W.E.; Gottlieb, P.D.; Waxdal, M. J. ; 
Edelman, G.M. 

Biochemistry 9, 3161-3170, 1970 



A; Title: The covalent structure of a human gammaG- immunoglobulin. VII. Amino 

acid sequence of heavy-chain cyanogen bromide fragments H-l-H-4. 

A;Reference number: A90563; MUID: 71064024 ; PMID : 5489771 

A; Contents: myeloma protein Eu 

A; Access ion: B90563 

A; Molecule type: protein 

A;Residues: 1-96, "R 1 , 98-135 <CUN> 

A; Note: this sequence has the Glm(3) marker, 97-Arg 

R;Rutishauser, U. ; Cunningham, B.A.; Bennett, C. ; Konigsberg, W.H.; Edelman, 
G.M. 

Biochemistry 9, 3171-3181, 1970 

A;Title: The covalent structure of a human gammaG- immunoglobulin. VIII. Amino 

acid sequence of heavy-chain cyanogen bromide fragments H-5-H-7. 

A;Reference number: A90564; MUID: 71064025 ; PMID: 5530842 

A; Contents : Eu 

A; Accession: A90564 

A;Molecule type: protein 

A; Residues : 136-154, *Q> , 156-165, ' Q f , 167-176, *Q* , 178-194, 1 N ' , 196-197, 'D',199- 
23 8, 'E* ,24 0, 'M' ,242-267, ' DGEPE ' ,273-32 9 <RUT> 

A;Note: this sequence has the Glm(non-l) markers, 239-Glu and 241-Met 
R;Ponstingl, H. ; Hilschmann, N. 

Hoppe-Seyler's Z. Physiol. Chem. 357, 1571-1604, 1976 
A,-Title: Die Primaerstruktur eines monoklonalen IgGl-Immunglobulins 
(Myelomprotein Nie) , III. Die chymotryptischen Peptide der H-Kette, Anordnung 
der tryptischen Peptide und Diskussion der vollstaendigen Primaerstruktur. 
A;Reference number: A91668; MUID: 77070269 ; PMID: 826475 
A; Contents: myeloma protein Nie 
A, -Access ion : B91668 
A;Molecule type: protein 

A; Residues: 1-34, 'Q' ,36-96, ' K* , 98-115, *Q' , 117-197, »D» , 199-238, 1 D 1 , 240 , ' L 1 ,242- 
268, ' E ' , 270 -271 , ' D ' ,273-33 0 <PON> 

A;Note: this sequence has the Glm(17) and Glm(l) markers 
R; Schmidt, W.E.; Jung, H.D.; Palm, W. ; Hilschmann, N. 
Hoppe-Seyler's Z. Physiol. Chem. 364, 713-747, 1983 

A;Title: Die Primaerstruktur des kristallisierbaren monoklonalen Immunglobulins 
IgGl KOL, I . 

A; Reference number: A91723; MUID: 83289131; PMID:6884994 

A/Contents : myeloma protein KOL; disulfide bonds 

A; Access ion: A91723 

A; Molecule type: protein 

A; Residues: 1-96, 'R' ,98-197, >D' ,199-238, 'E' ,240, 'M* ,242-266, ' D 1 ,268-271, 'D' ,273- 
330 <SCH> 

A; Note: this sequence has the Glm(3) and Glm(non-l) markers 
R;Gall, W.E.; Edelman, G.M. 
Biochemistry 9, 3188-3196, 1970 

A;Title: The covalent structure of a human gammaG- immunoglobulin. X. Intrachain 
disulfide bonds. 

A;Reference number: A90565; MUID: 71064027 ; PMID:4923144 
A; Contents: annotation; disulfide bonds 
R;Dreker, L. ; Schwarz, J. ; Reichel, W. ; Hilschmann, N. 
Hoppe-Seyler's Z. Physiol. Chem. 357, 1515-1540, 1976 

A; Title: Rule of antibody structure. The primary structure of monoclonal IgGl 
immunoglobulin (myeloma protein Nie), I: purification and characterization of 
the protein, the L- and H-chains the cyanogenbromide cleavage products, and the 
disulfide bridges. 

A; Reference number: A91667; MUID: 77070267 ; PMID:1002129 
A;Contents: annotation; disulfide bonds 



C;Genetics : 

A; Gene: GDB : IGHG1 

A; Cross -references : GDB: 120085; OMIM: 14 7100 
A ; Map position: 14q32 . 33-14q32 .33 
A;Introns: 99/1; 114/1; 224/1 

C; Complex: An immunoglobulin heterotetramer subunit consists of two identical 

light (kappa or lambda) and two identical heavy (alpha, delta, epsilon, gamma, 

or mu) chains usually stabilized by interchain disulfide bonds. In some cases, 

such as IgA and IgM, the subunit s associate into larger oligomers. 

C;Superfamily : immunoglobulin C region; immunoglobulin homology 

C ; Keywords : dupl icat ion ; glycoprotein ; heterotetramer ; immunoglobul in 

F;20-85/Domain: immunoglobulin homology <IM1> 

F; 137-206/Domain: immunoglobulin homology <IM2> 

F;243-310/Domain: immunoglobulin homology <IM3> 

F;27-83, 144-204, 250-3 08/Disul fide bonds: #status experimental 

F; 103/Disulf ide bonds: interchain (to light chain) #status experimental 

F; 109, 112/Disulf ide bonds: interchain (to heavy chain) #status experimental 

F;180/Binding site: carbohydrate (Asn) (covalent) #status experimental 

Query Match 99.6%; Score 1233; DB 1; Length 33 0; 

Best Local Similarity 100.0%; Pred. No. 9.4e-89; 

Matches 227; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVD 61 

M M II I I Mill II I III III M IMIMM IMIIMI 



MMIM I; I Ml II I M MIMM MM III IMM I MIM 

GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAK 
GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 

M M M I M IMM II Mill Ml I, IMM II IMIMM 

GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 
DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IIIMIIIIIIIIIMIIIIIIIIIIIMIIMIIIIIIIIIIIIII 



Qy 


2 


Db 


104 


Qy 


62 


Db 


164 


Qy 


122 


Db 


224 


Qy 


182 


Db 


284 



RESULT 3 
S69339 

Ig heavy chain V region precursor - human 
C; Species: Homo sapiens (man) 

C;Date: 19-Mar-1997 #sequence_revision 19-Mar-1997 #text_change Ol-Dec-2000 
C;Accession: S69339; S72664 

R;Khamlichi, A. A. ; Aucouturier, P.; Preud'homme, J.L.; Cogne, M. 
Eur. J. Biochem. 229, 54-60, 1995 

A; Title: Structure of abnormal heavy chains in human heavy-chain-deposition 
disease. 

A;Reference number: S69339; MUID: 95262687; PMID: 7744049 

A; Access ion: S6933 9 

A; Status : preliminary 

A; Molecule type: mRNA 

A ; Residues: 1-374 <KHA> 

A; Cross-references : EMBL:X81695 

R;Khamlichi, A. A. 



submitted to the EMBL Data Library, September 1994 

A; Reference number: S72664 

A /Access ion: S72664 

A; Status : preliminary 

A; Molecule type: mRNA 

A;Residues: 1-140, ! C* , 142-374 <KH2> 

A; Cross-references : EMBL:X81695 

C;Superfamily : immunoglobulin C region; immunoglobulin homology 

Query Match 99.1%; Score 1227; DB 2; Length 374; 

Best Local Similarity 99.1%; Pred. No. 3.2e-88; 

Matches 225; Conservative 2; Mismatches 0; Indels 0; Gaps 0; 
Qy 2 DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVD 61 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIM 

Db 14 8 DKTHTCPPCPAPELLGGPSVFLFPPKPKDTIJyilSRTPEVTC^AAmVSHEDPEVKFNWYVD 207 

Qy 62 GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAK 121 

IIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIII 

Db 208 GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAK 267 

Qy 122 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 181 

llllllllllllllhhllllllllllllllllllllllllllllllllllllllllll 

Db 268 GQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 327 

Qy 182 DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IIIIIIMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIII 

Db 328 DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 374 



RESULT 4 
PT0207 

Ig gamma chain C region - chimpanzee 
C; Species: Pan troglodytes (chimpanzee) 

C;Date: 23-Nov-1991 #sequence_revis ion 23-Nov-1991 #text_change 16-Jul-1999 
C; Access ion: PT02 07 

R;Ehrlich, P.H.; Moustafa, Z.A.; Oestberg, L. 
Mol. Immunol. 28, 319-322, 1991 

A; Title: Nucleotide sequence of chimpanzee Fc and hinge regions. 
A;Reference number: PT0207; MUID : 91287716 ; PMID:2062315 
A; Access ion: PT02 07 
A; Molecule type: mRNA 
A;Residues: 1-234 <EHR> 

C;Superfamily: immunoglobulin C region; immunoglobulin homology 
C ; Keywords : immunoglobul in 

F;48-117/Domain: immunoglobulin homology <IMM> 

Query Match 95.3%; Score 1180; DB 2; Length 234; 

Best Local Similarity 98.6%; Pred. No. 8.2e-85; 

Matches 217; Conservative 1; Mismatches 2; Indels 0; Gaps 0; 
QY 2 DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVD 61 

I lllllll IIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIII 

Db 15 DTTHTCPPCAAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVD 74 

QY 62 GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAK 121 

MIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIIIMIII 



Db 75 GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAK 134 

Qy 122 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 181 

I : . : I . i : i . i , ! s : - ; i i : i i ; ; i 

Db 135 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESSGQPENNYKTTPPVLDS 194 

Qy 182 DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKS 221 

I ! : ; ! ' I i I I I I I I I 

Db 195 DGS FFL YS KLTVDKS RWQQGNVFS CS VMHEALHNH YTQKS 234 

RESULT 5 
A23511 

Ig gamma-3 chain C region (allotype G3m(b)) - human 
C; Species: Homo sapiens (man) 

C;Date: 28-Dec-1987 #sequence_revision 28-Dec-1987 #text_change 23-Jul-1999 
C; Access ion: A23511 

R;Huck, S.; Fort, P.; Crawford, D.H. ; Lefranc, M.P.; Lefranc, G. 
Nucleic Acids Res. 14, 1779-1789, 1986 

A; Title: Sequence of a human immunoglobulin gamma 3 heavy chain constant region 

gene: comparison with the other human C-gamma genes. 

A; Reference number: A23511; MUID : 86148507 ; PMID:3081877 

A /Access ion: A23511 

A /Molecule type: DNA 

A; Residues: 1-377 <HUC> 

A; Cross-references: GB:X03604; GB:M12958; NID:g33070; PIDN : CAA27268 . 1 ; 

PID:g577056 

C;Genetics : 

A; Gene: GDB : IGHG3 

A; Cross -references: GDB: 119339; OMIM: 147120 
A ; Map position: 14q32 . 33 -14q32 . 33 

A;Introns: 98/3; 115/3; 130/3; 145/3; 160/3; 270/3 

C; Super family: immunoglobulin C region; immunoglobulin homology 

C; Keywords : immunoglobul in 

F;20-85/Domain: immunoglobulin homology <IMM> 

Query Match 92.6%; Score 1146; DB 2; Length 377; 

Best Local Similarity 92.5%; Pred. No. 6.7e-82; 

Matches 210; Conservative 8; Mismatches 9; Indels 0; Gaps 0; 
Qy 2 DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVD 61 

I M I'M II .1111 I I IMIIMI IMIIII M'lh 1 1 1 1 

Db 151 DTPPPCPRCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVQFKWYVD 210 

Qy 62 GVEVHNAKTKPREEQYNSTYRWSVLTVliHQDWLNGKEYKCKVSNKALPAPI EKTI SKAK 121 

IIIIIMIIIIIIIIIIIhlllllllMIIIIMIIIIIIIIIIIMIIIIIIIIII I 

Db 211 GVEVHNAKTKPREEQYNSTFRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKTK 270 

Qy 122 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 181 

IMIMIIIIIIIIhhllllMIIIIIMMIIIIIIIMhlllllll 1111 = 111 

Db 2 71 GQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESSGQPENNYNTTPPMLDS 330 

Qy 182 DGS FFLYSKiTVDKSRWQQGNVFSCS VMHEALHNH YTQKSLSLSPGK 228 

IIIIIIIIIIIIIIIIIMIhllllllMMM , 

Db 331 DGSFFLYSKLTVDKSRWQQGNIFSCSVMHEALHNRFTQKSLSLSPGK 377 



RESULT 6 
A60764 

Ig gamma -3 chain C region, form LAT - human 
C; Species: Homo sapiens (man) 

C;Date: 14-May-1993 #sequence_revision 14-May-1993 #text_change 16-Jul-1999 
C; Access ion: A60764 

R/Huck, S./ Lefranc, G . ; Lefranc, M.P. 
Immunogenetics 30, 250-257, 1989 

A; Title: A human immunoglobulin IGHG3 allele (GmbO, bl, c3 , c5, u) with an IGHG4 
converted region and three hinge exons . 

A/Reference number: A60764; MUID: 90007613 ; PMID:2571587 
A /Access ion : A60764 
A;Status: preliminary 
A /Molecule type: DNA 
A/Residues: 1-377 <HUC> 

C; Super family: immunoglobulin C region/ immunoglobulin homology 
C / Keywords : immunogl obul in 

F; 20-85/Domain: immunoglobulin homology <IMM> 

Query Match 92.4%/ Score 1144/ DB 2; Length 377; 

Best Local Similarity 92.5%/ Pred. No. 9.6e-82/ 

Matches 210/ Conservative 8/ Mismatches 9/ Indels 0; Gaps 0/ 
Qy 2 DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVD 61 

I II I II! I II M M II Mill II II IUIIM M 1 1 

Db 151 DTPPPCPRCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVQFKWYVD 210 

Qy 62 GVEVHNAKTKPREEQYNSTYRWS VLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAK 121 

MIIIIIIIIIIIIIMIhllllllllMIIIIIIIIIIIIIIIIIIMIMIIIM I 

Db 211 GVEVHNAKTKPREEQYNSTFRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKTK 270 

Qy 122 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 181 

llllllllllllllhhIIIIIIIIIIIIIIIIIMIIIIIhMIIIM 1 1 [ 1 1 1 1 1 

Db 271 GQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESSGQPENNYNTTPPVLDS 330 

Qy 182 DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IIMIIIhllllllMhIIIIIIIIIIMIII :|||llllllll 
Db 331 DGSFFLYSRLTVDKSRWQEGNVFSCSVMHEALHNRFTQKSLSLSPGK 377 



RESULT 7 
G2HU 

Ig gamma -2 chain C region - human 
C; Species: Homo sapiens (man) 

C;Date: 30-Apr-1981 #sequence_revision 13-Jun-1983 #text_change 21-Jul-2000 
C;Accession: A93906; A92809/ A90752; A93132; A02148 
R/Ellison, J./ Hood, L. 

Proc. Natl. Acad. Sci. U.S.A. 79, 1984-1988, 1982 

A/Title: Linkage and sequence homology of two human immunoglobulin gamma heavy 
chain constant region genes . 

A/Reference number: A93906/ MUID: 82197621 / PMID:6804948 
A/Accession: A93906 
A /Molecule type: DNA 
A/Residues: 1-326 <ELL> 

A; Cross-references: GB:V00554; GB:J00230; NID:g32759; PIDN : CAB58438 . 1 / 
PID:g6066056 

A/Note: Lys-326 is probably removed posttranslationally 



R;Wang, A.C.; Tung, E. ; Fudenberg, H.H. 
J. Immunol. 125, 1048-1054, 1980 

A; Title: The primary structure of a human IgG2 heavy chain: genetic, 

evolutionary, and functional implications. 

A; Reference number: A92809; MUID: 81007873 ; PMID:6774012 

A; Contents: myeloma protein Til 

A;Accession: A92809 

A; Molecule type: protein 

A; Residues: 1-19, 'Q* ,21-57, , Z* ,59, 'A* ,61-193, l D l ,195-325 <WAN> 
A;Note: Trp-156 is at or near the complement -binding site 
R; Cornell, G.E.; Parr, D.M.; Hofmann, T. 
Can. J. Biochem. 57, 758-767, 1979 

A; Title: The amino acid sequences of the three heavy chain constant region 

domains of a human IgG2 myeloma protein. 

A; Reference number: A90752; MUID: 80001357; PMID:113060 

A;Contents: myeloma protein Zie 

A;Accession: A90752 

A; Molecule type: protein 

A; Residues: 1-24, 1 E ' ,26-57, ' EV , 60-85 ; 132-171 , ' ZZZ ' , 175 ,' B ', 177-193 ,' D ' ,195- 

196, 'Q' , 198-234, 'Z' ,236-263, 1 BGEPZ 1 ,269-325 <CON> 

A;Note: this sequence has since been revised 

R;Hofmann, T. ; Parr, D.M. 

Mol. Immunol. 16, 923-925, 1979 

A;Title: A note on the amino acid sequence of residues 381-391 of human 
immunoglobulin gamma chains. 

A; Reference number: A93132; MUID : 80114419 ; PMID:118920 

A; Contents: Zie 

A, -Access ion: A93132 

A; Molecule type: protein 

A;Residues: 238-275 <HOF> 

R;Hofmann, T. ; Parr, D.M. 

submitted to the Atlas, March 1980 

A;Reference number: A94591 

A;Contents: annotation; Zie, revisions to residues 25, 59, 60, and 264-268 
A;Note: the revised sequence differs from that shown in having 60-Ala and in the 
amidation states of residues 58, 194, and 197; the amidation states of residues 
172-174, 176, and 235 were not determined 
R;Milstein, C. ; Frangione, B. 
Biochem. J. 121, 217-225, 1971 

A;Title: Disulphide bridges of the heavy chain of human immunoglobulin G2 . 

A;Reference number: A90253; MUID : 72033500 ; PMID: 4940472 

A; Contents: annotation; myeloma protein Sa, disulfide bonds 

R;Frangione, B.; Milstein, C. ; Pink, J.R.L. 

Nature 221, 145-148, 1969 

A; Title: Structural studies of immunoglobulin G. 

A; Reference number: A93157; MUID: 69064124 ; PMID:5782707 

A; Contents: annotation; Sa, disulfide bonds 

C; Genetics: 

A; Gene: GDB : I GHG2 

A;Cross-references : GDB: 119338; OMIM: 147110 
A ; Map position: 14q32 . 33-14q32 . 33 

C; Complex: An immunoglobulin heterotetramer subunit consists of two identical 

light (kappa or lambda) and two identical heavy (alpha, delta, epsilon, gamma, 

or mu) chains usually stabilized by interchain disulfide bonds. In some cases, 

such as IgA and IgM, the subunits associate into larger oligomers. 

C; Super family: immunoglobulin C region; immunoglobulin homology 

C; Keywords : duplication; glycoprotein; heterotetramer; immunoglobulin 



F; 20-85/Domain: immunoglobulin homology <IM1> 
F; 133 -2 02 /Domain : immunoglobulin homology <IM2> 
F;239-306/Domain: immunoglobulin homology <IM3> 

F;14/Disulf ide bonds: interchain (to light chain) #status experimental 

F;27-83, 140-200 , 246-304/Disul f ide bonds: #status experimental 

F;102, 103, 106, 109/Disulfide bonds: interchain (to heavy chain) #status 

experimental 

F; 176 /Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 92.3%; Score 1142.5; DB 1; Length 326; 

Best Local Similarity 94.1%; Pred. No. le-81; 

Matches 209; Conservative 8; Mismatches 4; Indels 1; Gaps 1; 

CPPCPAPELI^GPSVFLFPPKPKDTLMISRTPEWCVVVDVSHEDPEVKFNWyVDGVEVH 66 

Illllll ■ M M, . MM I I M I U M ; , MM 

CPPCPAPP-VAGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVQFNWYVDGVEVH 164 
NAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAKGQPRE 126 

M M ! MM' M MMl i I I : I IIIMIIIIII llllll 

NAKTKPREEQFNSTFRWSVLTWHQDWLNGKEYKCKVSNKGLPAPIEKTISKTKGQPRE 224 
PQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFF 186 

Ml . M : 1 M ' M I I I M MM i MMMM 

PQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPMLDSDGSFF 284 
LYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

MMMMMMMI MMMMMMMMI MMMM 

LYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLS PGK 326 



Qy 


7 


Db 


106 


Qy 


67 


Db 


165 


Qy 


127 


Db 


225 


Qy 


187 


Db 


285 



RESULT 8 
G4HU 

Ig gamma-4 chain C region - human 
C;Species: Homo sapiens (man) 

C;Date: 02-Apr-1982 #sequence_revision 02-Apr-1982 #text_change 16-Jul-1999 
C;Accession: A90933; A90249; A02150 
R;Ellison, J.; Buxbaum, J.; Hood, L. 
DNA 1, 11-18, 1981 

A;Title: Nucleotide sequence of a human immunoglobulin C-gamma4 gene. 
A;Reference number: A90933; MUID: 83157104 ; PMID: 6299662 
A; Access ion: A90933 
A;Molecule type: DNA 
A;Residues: 1-327 <ELL> 

A; Note: the sequence was determined from the germline gene 
R;Pink, J.R.L.; Buttery, S.H.; De Vries, G.M. ; Milstein, C. 
Biochem. J. 117, 33-47, 1970 

A; Title: Human immunoglobulin sublclasses. Partial amino acid sequence of the 
constant region of A gamma4 chain. 

A; Reference number: A90249; MUID: 70207560; PMID:4192699 

A; Access ion : A9024 9 

A;Molecule type: protein 

A;ResidueS: l-30;81-326 <PIN> 

C; Genetics : 

A; Gene: GDB : IGHG4 

A; Cross-references : GDB: 119340; OMIM: 147130 
A ; Map position: 14q32 . 33 -14q32 . 33 
A;Introns: 99/1; 111/1; 221/1 



C; Complex: An immunoglobulin heterotetramer subunit consists of two identical 

light (kappa or lambda) and two identical heavy (alpha, delta, epsilon, gamma, 

or mu) chains usually stabilized by interchain disulfide bonds. In some cases, 

such as IgA and IgM, the subunits associate into larger oligomers. 

C; Superfamily : immunoglobulin C region; immunoglobulin homology 

C; Keywords : duplication; glycoprotein; heterotetramer; immunoglobulin 

F ; 2 0 - 8 5 / Doma in : immunoglobulin homology <IM1> 

F; 99-110/Region: hinge 

F; 134-203/Domain: immunoglobulin homology <IM2> 
F; 240-307/ Doma in : immunoglobulin homology <IM3> 

F; 14/Disulf ide bonds: interchain (to light chain) #status experimental 
F;27-83,141-201,247-305/Disulfide bonds : #status predicted 

F; 106, 109/Disulf ide bonds: interchain (to heavy chain) #status experimental 
F; 177/Binding site: carbohydrate (Asn) (covalent) #status predicted 



Query Match 91.7%; Score 1135; DB 1; Length 327; 

Best Local Similarity 93.7%; Pred. No. 4e-81; 

Matches 208; Conservative 8; Mismatches 6; Indels 0; Gaps 0; 



Qy 


7 


CPPCPAPELI^GPSVFLFPPKPKI)TLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVH 


66 






II Mill IIIIIIIMIIIIIIIMIIIIIIIIIIIIIII llllhlllllllllll 




Db 


106 


CPSCPAPEFLGGPSVFLFPPKPKDTLMISRTPEVTCWVDVSQEDPEVQFNWYVDGVEVH 


165 


Qy 


67 


NAKTKPREEQYNSTYRWS VLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAKGQPRE 


126 






lllllllllhlllllMIIIIIIIIMIIIIIIIIIIMI 1 llllllllllllll 




Db 


166 


NAKTKPREEQFNSTYRWSVIjTVLHQDWLNGKEYKCKVSNKGLPSS I EKTI SKAKGQPRE 


225 


Qy 


127 


PQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFF 


186 


Db 




: i 1 1 ■ , i , i i 1 : 1 1 ! 1 ■ 1 : ■ i = i ! . 1 1 , 




226 


PQVYTLPPSQEEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFF 


285 


Qy 


187 


LYSKLTVDKSRWQQGNVFSCS VMHEALHNH YTQKSLSLSPGK 228 








MhllllllllhlllllllllllMIIIIIIMIIII II 




Db 


286 


LYSRLTVDKSRWQEGNVFSCSVMHEALHNHYTQKSLS LSLGK 327 





RESULT 9 
G3HUWI 

Ig gamma-3 heavy chain disease proteins - human 
C; Species: Homo sapiens (man) 

C;Date: 31-Dec-1979 #sequence_revision 23-Oct-1981 #text_change 16-Jul-1999 
C;Accession: A90442; A92219; A90198; A93915; A02149 
R;Frangione, B.; Rosenwasser, E.; Prelli, F.; Franklin, E.C. 
Biochemistry 19, 4304-4308, 1980 

A;Title: Primary structure of human gamma3 immunoglobulin deletion mutant: 
gamma3 heavy-chain disease protein Wis. 

A;Reference number: A90442; MUID: 81021548 ; PMID: 6774747 
A; Contents: heavy chain disease protein Wis 
A; Access ion : A90442 
A;Molecule type: protein 
A;ResidueS: 1-289 <FRA> 

A;Note: the molecule is a dimer linked by 12 disulfide bonds; it has an extra 
interchain disulfide bond at position 7 in addition to the 11 normally present 
in the hinge region 

A;Note: this protein lacks most of the V region and all of the CHI region. 
Residue 12 corresponds to the beginning of the hinge region of normal gamma-3 
chains 



A;Note: the sequence of residues 42-76 was taken from the reference that follows 
R;Michaelsen, T.E.; Frangione, B.; Franklin, E.C. 
J. Biol. Chem. 252, 883-889, 1977 

A;Title: Primary structure of the 'hinge* region of human IgG3 . Probable 
quadruplication of a 15 -amino acid residue basic unit. 
A;Reference number: A92219; MUID: 77118561 ; PMID: 402363 

A;Contents: normal gamma-3 chains, sequence corresponding to residues 12-97 of 
protein Wis 
A;Accession: A92219 
A;Molecule type: protein 
A;Residues: 12-97 <MIC> 

A;Note: the hinge region in gamma-3 chains is about four times as long as in 
other gamma, chains and contains three identical 15-residue segments (29-43, 44- 
58, and 59-73) preceded by a similar 17-residue segment (12-28) 
A;Note: cysteines at positions 24, 27, 33, 39, 42, 48, 54, 57, 63, 69, and 72 
form inter-heavy chain bonds 

R;Wolfenstein-Todel, C. ; Frangione, B. ; Prelli, F . ; Franklin, E.C. 
Biochem. Biophys . Res. Commun. 71, 907-914, 1976 

A; Title: The amino acid sequence of "heavy chain disease" protein ZUC Structure 

of the Fc fragment of immunoglobulin G3 . 

A;Reference number: A90198; MUID: 77021516 ; PMID:823945 

A; Contents: heavy chain disease protein Zuc, partial sequence corresponding to 

residues 59-289 of protein Wis 

A;Accession: A90198 

A; Molecule type: protein 

A;Residues: 59-125, 'EB' , 128-226,228-289 <W0L> 

A;Note: this protein lacks most of the V region, all of the CHI region, and part 
of the hinge compared with normal gamma-3 heavy chains 

R;Alexander, A.; Steinmetz, M. ; Barritault, D. ; Frangione, ' B . ; Franklin, E.C; 
Hood, L. ; Buxbaum, J.N. 

Proc. Natl. Acad. Sci. U.S.A. 79, 3260-3264, 1982 

A; Title: gamma heavy chain disease in man: cDNA sequence supports partial gene 
deletion model. 

A/Reference number: A93915; MUID: 82247835 ; PMID: 6808505 
A/Contents : heavy chain disease protein Omm 
A, -Access ion : A93 915 
A /Molecule type: mRNA 

A; Residues: 12 -70; 72 -114; 116 -125, 'E* ,127-133, ' L ' ,135-136, 'E' ,138, f Y f ,140- 

154, 'D ',156-157, 'N' ,159-181, 'A' , 183 - 186 ; 19 9 -22 6 , 'N' ,228-278, 'Y' ,280-289 <ALE> 

A; Note: a carboxyl -terminal Lys is removed posttranslationally 

A;Note: this sequence may represent an allelic form or another gamma chain 

subclass 

C; Comment: The heavy chain disease protein Wis is shown. 

C;Genetics : 

A; Gene: GDB : IGHG3 

A; Cross-references : GDB: 119339; OMIM: 14712 0 
A ; Map position: 14q32 . 33 -14q32 . 33 

C;Superfamily: immunoglobulin C region; immunoglobulin homology 
C,-Keywords: duplication; glycoprotein; immunoglobulin; pyroglutamic acid 
F;203-270/Domain: immunoglobulin homology <IMM> 

F ; l/Modified site: pyrrolidone carboxylic acid (Gin) #status experimental 
F;6, 140/Binding site: carbohydrate (Asn) (covalent) #status experimental 

Query Match 90.5%; Score 1121; DB 1; Length 289; 

Best Local Similarity 90.3%; Pred. No. 4.2e-80; 

Matches 2 04; Conservative 13; Mismatches 9; Indels 0; Gaps 0; 



Qy 



2 



DKTHTCPPCPAPELLGGPS VFLFPPKPKDTLMI S RT P E VTC VWD VS H ED P E VKFNW Y VD 6 1 



Db 



64 




Qy 



62 



GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAK 121 



Db 



124 




Qy 



122 



GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 181 



Db 



184 




Qy 



182 



DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG 227 



Db 



244 




RESULT 10 
GHRB 

Ig gamma chain C region - rabbit 

C; Species: Oryctolagus cuniculus (domestic rabbit) 

C;Date: 24-Apr-1984 #sequence_revision 15-Nov-1984 #text_change 16-Jul-1999 
C;Accession: A91749; A90290; A93928; A90245; A94416; A02161 
R;Bernstein, K.E.; Alexander, C.B.; Mage, R.G. 
Immunogenetics 18, 387-397, 1983 

A; Title: Nucleotide sequence of a rabbit IgG heavy chain from the recombinant F- 
I haplotype. 

A;Reference number: A91749; MUID : 84030930 ; PMID: 6313520 
A; Accession: A9174 9 
A; Molecule type: mRNA 
A; Residues: 1-323 <BER> 

A;Note: this sequence has the dl2 allotypic marker, 104-Thr, and the el4 marker, 
185 -Thr 

R; Pratt, D.M.; Mole, L.E. 
Biochem. J . 151, 337-349, 1975 

A; Title: Sequence studies on the constant region of the Fd sections of rabbit 
immunoglobulin G of different allotype. 

A; Reference number: A90290; MUID : 76135469 ; PMID:1243651 

A;Accession: A90290 

A; Molecule type: protein 

A;Residues: 1-47, 'E' ,49-71, 'PV 1 ,72-128 <PRA> 

R;Martens, C.L.; Moore, K.W.; Steinmetz, M. ; Hood, L. ; Knight, K.L. 
Proc. Natl. Acad. Sci. U.S.A. 79, 6018-6022, 1982 

A; Title: Heavy chain genes of rabbit IgG; isolation of a cDNA encoding gamma 
heavy chain and identification of two genomic C-gamma genes. 
A;Reference number: A93928; MUID: 83299917; PMID: 6193512 
A; Access ion: A93928 
A; Molecule type: mRNA 

A; Residues: 88-103, ' M ' ,105-143, 'E* ,145-184, 'A* ,186, *E f ,188-266 <MAR> 
A; Cross -references : GB:M16426; NID : gl65111 ; PIDN:AAA31289 . 1 ; PID:gl65112 
A;Note: this sequence has the dll allotypic marker, 104 -Met, and the el5 
allotypic marker, 185-Ala 

R;Fruchter, R.G. ; Jackson, S.A.; Mole, L.E.; Porter, R.R. 
Biochem. J. 116, 249-259, 1970 

A; Title: Sequence studies of the Fd section of the heavy chain of rabbit 
immunoglobulin G. 

A;Reference number: A90245; MUID : 70110015 ; PMID: 5461106 



A; Accession : A90245 

A; Molecule type: protein 

A;Residues: 132-143, 'E' , 145-161 <FRU> 

R;Hill, R.L.; Lebovitz, H.E.; Fellows Jr., R.E.; Delaney, R. 

in Gamma Globulins, Nobel Symp. 3, Killander, J. , ed. , pp. 109-127, Almqvist and 

Wiksell, Stockholm, 1967 

A; Reference number: A94416 

A; Accession : A94416 

A; Molecule type: protein 

A /Residues: 12 9-131; 155-172, ! D' ,174-184, 'A' ,186, 'E' ,188-200, 'D ! ,202-217, ! E' ,219- 
232, 'Q' ,234-245, ' D * ,247-255, 'G' ,257-2 59, »D' ,261-265, 'D» ,267-27 9, 'W ,281- 
283, 'S' ,285-322 <HIL> 

A;Note: this has the el5 allotypic marker, 185-Ala 

C; Complex: An immunoglobulin heterotetramer subunit consists of two identical 

light (kappa or lambda) and two identical heavy (alpha, delta, epsilon, gamma, 

or mu) chains usually stabilized by interchain disulfide bonds. In some cases, 

such as IgA and IgM, the subunits associate into larger oligomers. 

C; Super family : immunoglobulin C region; immunoglobulin homology 

C; Keywords : duplication; glycoprotein; heterotetramer; immunoglobulin 

F;20-82/Domain: immunoglobulin homology <IM1> 

F; 13 0-1 9 9 /Domain : immunoglobulin homology <IM2> 

F;236-303/Domain: immunoglobulin homology <IM3> 

F; 173 /Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 74.2%; Score 918.5; DB 1; Length 323; 

Best Local Similarity 71.7%; Pred. No. 2.9e-64; 

Matches 167; Conservative 29; Mismatches 32; Indels 5; Gaps 2; 
Qy 1 MDKT HTC--PPCPAPELLGGPSVFLFPPKPKDTLMISRTPEWCVVVDVSHEDPEVK 55 

= 111 II I II IIIIIMIIhllllllMIIIIIIIIIIIIIIIII i 

Db 91 VDKTVAPSTCSKPTCPPPELLGGPSVFIFPPKPKDTLMISRTPEVTCVWDVSQDDPEVQ 150 

Qy 56 FNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEK 115 

I = 1 h I U-l I I M I : Mill llhllll IIMIIIIII 

Db 151 FTWY I NNEQVRTAR P PLREQQFNST I RWSTLP I THQDWLRGKEFKCKVHNKALPAP I EK 210 

Qy 116 TISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTT 175 

I Ihllh I hlh M ' : llllllhllll I hlllll 

Db 211 TISKARGQPLEPKVYTMGPPREELSSRSVSLTCMINGFYPSDISVEWEKNGKAEDNYKTT 27 0 

Qy 176 PPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

I Mllllhllhlhl I IhhlhMMMMMMMIhl MM 

Db 271 PAVLDSDGSYFLYNKLSVPTSEWQRGDVFTCSVMHEALHNHYTQKSISRSPGK 323 



RESULT 11 
147160 

Ig gamma 2b chain constant region - pig (fragment) 
C; Species: Sus scrofa domestica (domestic pig) 

C,*Date: 21-Feb-1997 #sequence_revision 21-Feb-1997 #text_change 21-Jan-2000 
C; Access ion: 147160 

R;Kacskovics, I.; Sun, J. ; Butler, J.E. 
J. Immunol. 153, 3565-3573, 1994 

A; Title: Five putative subclasses of swine IgG identified from the cDNA 
sequences of a single animal. 

A,-Reference number: 147158; MUID: 95015845 ; PMID:7930579 
A; Access ion: 14 7160 



A;Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A;Residues: 1-328 <KAC> 

A; Cross-references : EMBL:U03780; NID:g433125; PIDN : AAA52218 . 1 ; PID:g433126 
C;Genetics : 
A; Gene: IgG2b 

C;Superfamily : immunoglobulin C region; immunoglobulin homology 
F; 133 -2 02 /Domain : immunoglobulin homology <IMM> 

Query Match 73.2%; Score 906.5; DB 2; Length 328; 

Best Local Similarity 73.2%; Pred. No. 2.6e-63; 

Matches 164; Conservative 29; Mismatches 28; Indels 3; Gaps 2 
Qy 7 CPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVH 66 

II III I I I Mill: I . MM M hMhhIIIIIIMI 

Db 106 CPICPACE-SPGPSVFIFPPKPKDTLMISRTPQVTCVWDVSQENPEVQFSWYVDGVEVH 164 

Qy 67 NAKTKPREEQYNSTYRWS VXiTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAKGQPRE 126 

hhhilhllllllllll = llllllllhlllhll Mill : lllllll II 
Db 165 TAQTRPKEEQFNSTYRWSVLP I QHQDWLNGKEFKCKVNNKDLPAPI TRI I SKAKGQTRE 224 

Qy 127 PQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQ- - PENNYKTTPPVLDSDGS 184 

MINIM MM M Mh III II Ihllll I Ih 

Db 225 PQVYTLPPHAEELSRSKVS I TCLVI GFYPPDI DVEWQRNGQPEPEGNYRTTPPQQDVDGT 284 

Qy 185 FFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

MUM : 1 1 h II I : I hlllllllMMIIhl MM 

Db 285 YFLYSKFSVDKASWQGGGI FQCAVMHEALHNHYTQKS I S KTPGK 328 



RESULT 12 
147159 

Ig gamma 2a chain constant region - pig (fragment) 
C; Species: Sus scrofa domestica (domestic pig) 

C;Date: 21-Feb-1997 #sequence_revision 21-Feb-1997 #text__change 21-Jan-2000 
C; Access ion: 14 715 9 

R;Kacskovics / I.; Sun, J. ; Butler, J.E. 
J. Immunol. 153, 3565-3573, 1994 

A;Title: Five putative subclasses of swine IgG identified from the cDNA 
sequences of a single animal. 

A; Reference number: 147158; MUID: 95015845; PMID:7930579 
A; Access ion: 147159 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A;Residues: 1-328 <KAC> 

A; Cross-references : EMBL:U03779; NID:g433123; PIDN : AAA52217 . 1 ; PID:g433124 
C; Genetics : 
A; Gene: IgG2a 

C; Superfamily : immunoglobulin C region; immunoglobulin homology 
F; 133-202/Domain: immunoglobulin homology <IMM> 

Query Match 73.2%; Score 906.5; DB 2; Length 328; 

Best Local Similarity 73.2%; Pred. No. 2.6e-63; 

Matches 164; Conservative 29; Mismatches 28; Indels 3; Gaps 2 
Qy 7 CPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCWVDVSHEDPEVKFNWYVDGVEVH 66 

II III I Mllhllllllllllllllhlllllllll hllhhillllllll 



Db 106 CPICPACE-SPGPSVFIFPPKPKDTLMISRTPQVTCVWDVSQENPEVQFSWYVDGVEVH 164 

Qy 67 NAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPRE 126 

I H : I H I I : I I I I I I I I I I : llllllllhlllhll Mill : Mlllll II 
Db 165 TAQTRPKEEQFNSTYRWSVLPIQHQDWLNGKEFKCKVNNKDLPAPITRI ISKAKGQTRE 224 

Qy 127 PQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQ- -PENNYKTTPPVLDSDGS 184 

IMIIIII Mh-MIMIM I I I I II Mh III II Ihllll I Ih 
Db 225 PQVYTLPPHAEELSRSKVSITCLVIGFYPPDIDVEWQRNGQPEPEGNYRTTPPQQDVDGT 284 

Qy 185 FFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

= 11111 :|lh II I M hlllllllllllllhl MM 
Db 285 YFLYS KFS VDKASWQGGG I FQCAVMHEALHNHYTQKS I S KTPGK 328 

RESULT 13 
147162 

Ig gamma 4 chain constant region - pig (fragment) 
C; Species: Sus scrofa domestica (domestic pig) 

C;Date: 21-Feb-1997 #sequence_revision 21-Feb-1997 #text_change 21-Jan-2000 
C; Accession: 147162 

R;Kacskovics, I.; Sun, J.; Butler, J.E. 
J. Immunol. 153, 3565-3573, 1994 

A; Title: Five putative subclasses of swine IgG identified from the cDNA 
sequences of a single animal. 

A/Reference number: 147158; MUID : 95015845 ; PMID : 7930579 
A, -Access ion: 14 7162 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A;Residues: 1-277 <KAC> 

A; Cross -references: EMBL:U03782; NID : g433 12 9 ; PIDN : AAA52220 . 1 ; PID:g433130 
C;Genetics : 
A; Gene: IgG4 

C,-Superfamily: immunoglobulin C region; immunoglobulin homology 
F; 82 -151/Domain : immunoglobulin homology <IMM> 

Query Match 73.0%; Score 903.5; DB 2; Length 277; 

Best Local Similarity 71.1%; Pred. No. 3.6e-63; 

Matches 167; Conservative 30; Mismatches 31; Indels 7; Gaps 4; 
Qy 1 MDK THTCPPCP-APELLG-GPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVK 55 

Ml I I MM I I III hllllllllllllllhlMIIIIII hllh 

Db 43 VDKRVGTKTKPPCPICPACEGPGPSAFIFPPKPKDTLMISRTPKVTCWVDVSQENPEVQ 102 

Qy 56 FNW Y VDGVE VHNAKTKPREEQYNSTYRWS VLTVLHQDWLNGKE YKCKVSNKALPAP I EK 115 

hlllllllll hhhilhllllllllll : llllllllhlllhll Mill : 
Db 103 FSWYVDGVEVHTAQTRPKEEQFNSTYRWSVLPIQHQDWLNGKEFKCKVNNKDLPAPITR 162 

Qy 116 TI SKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQ- - PENNYK 173 

IMMM 11111,1 :|h:::|:||||| II II ||h III II Ih 

Db 163 I I SKAKGQTREPQVYTLPPPTEELSRSKVTLTCLVTGFYPPDI DVEWQRNGQPEPEGNYR 222 

Qy 174 TTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

MM I IhMIIIII Mh Ihh I hllllMIMIIIIh MM 

Db 223 TTPPQQDVDGTYFLYSKLAVDKASWQRGDTFQCAVMHEALHNHYTQKS I FKTPGK 277 



RESULT 14 
G2GP 

Ig gamma-2 chain C region - guinea pig 
C; Species: Cavia porcellus (guinea pig) 

C;Date: 07-May-1981 #sequence_revision 07-May-1981 #text_change 16-Jul-1999 

C;Accession: A94553; A90352; A90359; A90384; A90385; A02151 

R ; Trischmann, T . M . 

submitted to the Atlas, April 1975 

A; Reference number: A94553 

A; Access ion: A94553 

A;Molecule type: protein 

A; Residues: 1-3 <TRI> 

R;Birshtein, B.K./ Hussain, Q.Z.; Cebra, J.J. 
Biochemistry 10, 18-25, 1971 

A;Title: Structure of heavy chain from strain 13 guinea pig immunoglobulin-G (2) . 
III. Amino acid sequence of the region around the half-cystine joining heavy and 
light chains. 

A;Reference number: A90352; MUID: 71058471 ; PMID:5538606 

A /Access ion: A90352 

A;Molecule type: protein 

A/Residues : 4-68 <BIR> 

R;Turner, K.J.; Cebra, J.J. 

Biochemistry 10, 9-17, 1971 

A;Title: Structure of heavy chain from strain 13 guinea pig immunoglobulin-G (2) . 
II. Amino acid sequence of the carboxyl -terminal and hinge region cyanogen 
bromide fragments . 

A;Reference number: A90359; MUID: 71058486; PMID:5538616 

A, -Access ion: A90359 

A;Molecule type: protein 

A;Residues: 69-133/312-329 <TUR> 

R/Tracey, D.E.; Cebra, J.J. 

Biochemistry 13, 4796-4803, 1974 

A; Title: Primary structure of the C-H2 homology region from guinea pig IgG2 
antibodies . 

A/Reference number: A90384/ MUID : 75036072 ; PMID:4429665 

A /Access ion: A90384 

A/ Molecule type: protein 

A/Residues: 134-226 <TRA> 

R;Trischmann, T.M.; Cebra, J.J. 

Biochemistry 13, 4804-4811, 1974 

A,- Title: Primary structure of the C-H3 homology region from guinea pig IgG2 
antibodies . 

A/Reference number: A90385/ MUID: 75036073 ; PMID:4609467 

A/ Access ion: A90385 

A/ Molecule type: protein 

A/Residues: 227-311 <TR2> 

R/Oliveira, B. / Lamm, M.E. 

Biochemistry 10, 26-31, 1971 

A/Title: Interchain disulfide bridges of guinea pig gamma-2- immunoglobulin. 

A/Reference number: A90354/ MUID: 71058474 ; PMID:4922544 

A/Contents : annotation,- disulfide bonds 

A/ Note : Cys-16 is involved in a heavy- light chain bond 

A/Note: Cys-105, Cys-107, and Cys-110 form inter-heavy chain bonds 

C;Comment: This chain was isolated from pooled serum of strain 13 inbred guinea 

pigs. 

C; Complex: An immunoglobulin heterotetramer subunit consists of two identical 
light (kappa or lambda) and two identical heavy (alpha, delta, epsilon, gamma, 



or mu) chains usually stabilized by interchain disulfide bonds. In some cases, 

such as IgA and IgM, the subunits associate into larger oligomers. 

C;Superfamily: immunoglobulin C region; immunoglobulin homology 

C; Keywords : duplication; glycoprotein; heterotetramer ; immunoglobulin 

F ; 2 1 - 8 1 / Doma in : immunoglobulin homology <IM1> 

F; 135-204/Domain: immunoglobulin homology <IM2> 

F; 241-310/Domain: immunoglobulin homology <IM3> 

F; 28 -7 9/Disul f ide bonds: #status experimental 

F; 142-202/Disulf ide bonds: #status experimental 

F; 178 /Binding site: carbohydrate (Asn) (covalent) #status experimental 
F;248-308/Disulf ide bonds: #status experimental 

Query Match 71.8%; Score 889; DB 1; Length 329; 

Best Local Similarity 72.3%; Pred. No. 6e-62; 

Matches 162; Conservative 24; Mismatches 36; Indels 2; Gaps 1; 
Qy 6 TCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEV 65 

III II II llllllhllllMIIIIM II MINIMI ' U Mil I 

Db 106 TCPKCPPPENLGGPSVFIFPPKPKDTIJyiISLTPRVTCvVVDVSQDEPEVQFTWFVDNKPV 165 

Qy 66 HNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAKGQPR 125 

Ihllll lllhhll III : Mill llhllll llllllllllllll II II 
Db 166 GNAETKPRVEQYNTTFRVESVLPIQHQDWLRGKEFKCKVYNKALPAPIEKTISKTKGAPR 225 

Qy 126 EPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQP- -ENNYKTTPPVLDSDG 183 

I I I I I I I I I I I h I : : I h I I h hhll III II I I M I I 
Db 226 MPDVYTLPPSRDELSKSKVSVTCLIINFFPADI HVEWASNRVPVSEKEYKNTPPI EDADG 285 

Qy 184 SFFLYS KLTVDKSRWQQGNVFS CS VMHEALHNHYTQKSLSLS PG 227 

hlMMMIMI I M hMMMIIMM I III 

Db 286 SYFLYSKLTVDKSAWDQGTVYTCSVMHEALHNHVTQKAISRSPG 329 



RESULT 15 
147158 

Ig gamma 1 chain constant region - pig (fragment) 
C; Species: Sus scrofa domestica (domestic pig) 

C;Date: 21-Feb-1997 #sequence_revision 21-Feb-1997 #text_change 21-Jan-2000 
C; Access ion: 147158 

R;Kacskovics, I.; Sun, J. ; Butler, J.E. 
J. Immunol. 153, 3565-3573, 1994 

A;Title: Five putative subclasses of swine IgG identified from the cDNA 
sequences of a single animal. 

A;Reference number: 147158; MUID : 95015845 ; PMID: 7930579 
A; Access ion: 14 7158 

A;Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A;Residues: 1-328 <KAC> 

A; Cross-references : EMBL:U03778; NID:g433121; PIDN : AAA52216 . 1 ; PID:g433122 
C;Genetics : 
A; Gene: IgGl 

C;Superfamily : immunoglobulin C region; immunoglobulin homology 
F ; 133 -2 02 /Domain : immunoglobulin homology <IMM> 

Query Match 71.5%; Score 885.5; DB 2; Length 328; 

Best Local Similarity 72.4%; Pred. No. l.le-61; 

Matches 163; Conservative 27; Mismatches 32; Indels 3; Gaps 2; 



Qy 


6 


TCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEV 65 

III M 1 ■ MllhlMIIMMIIhllMIIIMIM 1 IhhIMIIIII 

TCPI CPGCE-VAGPSVFI FPPKPKDTLMI SQTPEVTCVWDVSKEHAEVQFSWYVDGVEV 163 


Db 


105 


Qy 


66 


HNAKTKPREEQYNSTYRWS VLTVLHQDWLNGKEYKCKVSNKALPAP I EKTI SKAKGQPR 125 
1 hhhllhllllllllll : HIM 1 1 1 : 1 1 1 ' ||||| 1 | 

HTAETRPKEEQFNSTYRWS VLP I QHQDWLKGKEFKCKVNNVDLPAP I TRTI SKAIGQSR 223 


Db 


164 


Qy 


126 


EPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQ- -PENNYKTTPPVLDSDG 183 
IMIMIII :|h:::|:||||l 1 || hi ||l hllll 1 II 
EPQVYTLPPPAEELSRSKVTLTCLVIGFYPPDIHVEWKSNGQPEPENTYRTTPPQQDVDG 283 


Db 


224 


Qy 


184 


SFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 
:|IMIII llhll h | 1 1 !l : | | 
TFFLYSKLAVDKARWDHGDKFECAVMHEALHNHYTQKS I SKTQGK 328 


Db 


284 



Search completed: January 16, 2004, 09:39:52 
Job time : 22 sees 



I 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



January 16, 2004, 09:38:52 ; Search time 34 Seconds 

(without alignments) 
1371.227 Million cell updates/sec 



Title: 

Perfect score : 
Sequence : 



US-09-843-221A-2 
1238 

1 MDKTHTCPPCPAPELLGGPS . 



. MHEALHNHYTQKSLSLSPGK 228 



Scoring table: 



BLOSUM62 
Gapop 10.0 



Gapext 0 . 5 



Searched: 



762491 seqs, 204481190 residues 



Total number of hits satisfying chosen parameters: 



762491 



Minimum DB seq length: 
Maximum DB seq length: 



2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : 



Published_Applications_AA: * 

1: /cgn2_6/ptodata/l/pubpaa/US07_PUBCOMB.pep:* 

2 : /cgn2_6/ptodata/l/pubpaa/PCT_NEW_PUB.pep: * 

3 : /cgn2_6/ptodata/l/pubpaa/US06_NEW_PUB.pep: * 

4 : /cgn2_6/ptodata/l/pubpaa/US06_PUBCOMB.pep: * 

5 : / cgn2_6 /p t oda t a / 1 /pubpaa / US 0 7_NEW_PUB . p ep : * 

6 : /cgn2_6/ptodata/l/pubpaa/PCTUS_PUBCOMB.pep: 

7 : /cgn2_6/ptodata/l/pubpaa/US08_NEW_PUB.pep: * 

8 : /cgn2_6/ptodata/l/pubpaa/US08_PUBCOMB.pep: * 

9: /cgn2_6/ptodata/l/pubpaa/US09A_PUBCOMB.pep: 

10: /cgn2_6/ptodata/l/pubpaa/US09B_PUBCOMB.pep 

11 : /cgn2_6/ptodata/l/pubpaa/US09C_PUBCOMB.pep 

12 : /cgn2_6/ptodata/l/pubpaa/US09_NEW_PUB.pep: 

13 : /cgn2_6/ptodata/l/pubpaa/US10A_PUBCOMB.pep 

14 : /cgn2_6/ptodata/l/pubpaa/US10B_PUBCOMB.pep 

15 : /cgn2_6/ptodata/l/pubpaa/US10C_PUBCOMB.pep 

16 : /cgn2_6/ptodata/l/pubpaa/US10_NEW_PUB.pep 

17 : /cgn2_6/ptodata/l/pubpaa/US60_NEW_PUB.pep 

18 : /cgn2_6/ptodata/l/pubpaa/US60_PUBCOMB.pep 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
US-09-847-712-2 

; Sequence 2, Application US/09847712 

; Patent No. US20020090646A1 

; GENERAL INFORMATION: 

; APPLICANT: LIU, CHUAN-FA 



APPLICANT: MARSHALL, WILLIAM S. 
APPLICANT: REYNOLDS, ANGELA 

TITLE OF INVENTION: CALCITONIN-RELATED MOLECULES 
FILE REFERENCE: A-684 

CURRENT APPLICATION NUMBER: US/0 9/847 , 712 
CURRENT FILING DATE: 2001-05-02 
PRIOR APPLICATION NUMBER: 60/201,511 
PRIOR FILING DATE: 2000-05-03 
NUMBER OF SEQ ID NOS : 26 
SOFTWARE: Patent In version 3.1 
SEQ ID NO 2 
LENGTH: 228 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-09-847-712-2 

Query Match 100.0%; Score 1238; DB 9; Length 228; 

Best Local Similarity 100.0%; Pred. No. 1.9e-99; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCvVVDVSHEDPEVKFNWYV 60 

MIMIIIIIIIIMIIIIIIIIIIIIIIMIIMIIIIIIIIIIMIIIIIIIIIIIII 

Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

MIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIII|I||||||||||||| 

Db 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

IIIMIIIIIIIIIMIMIIIIIIIIIIIIIMMIIIIIIIIIIIIMIIIIIIIIII 

Db 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

: I I 'I i ' II I M 

Db 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 



RESULT 2 
US-09-840-277-2 

; Sequence 2, Application US/09840277 

; Patent No. US20020168363A1 

; GENERAL INFORMATION: 

; APPLICANT: FEIGE, ULRICH 

; APPLICANT: KOHNO, TADAHIKO 

; APPLICANT: LACEY, DAVID LEE 

; APPLICANT: BOONE, THOMAS CHARLES 

; TITLE OF INVENTION: INTEGRIN/ADHESION ANTAGONISTS 
; FILE REFERENCE: A-688A 

; CURRENT APPLICATION NUMBER: US/09/840,277 

; CURRENT FILING DATE: 2001-08-14 

; PRIOR APPLICATION NUMBER: 60/198,919 

; PRIOR FILING DATE: 2000-04-21 

PRIOR APPLICATION NUMBER: 60/201,394 
; PRIOR FILING DATE: 2000-05-03 
; NUMBER OF SEQ ID NOS: 135 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 2 



LENGTH: 228 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-840-277-2 



Qy 


1 


Db 


i 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 



Query Match 100.0%; Score 1238; DB 10; Length 228; 

Best Local Similarity 100.0%; Pred. No. 1.9e-99; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI SRTPEVTCVWDVSHEDPEVKFNWYV 60 

■ I - I . i i I I I I i : i I 1 ' ! I I . 

MDKTHTCPPCPAPELI^GPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 60 

DGVEVHNAKTKPREEQYNSTYRWS VLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 1 2 0 

IIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIII 

DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 
KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

■ " I ■ 1 i ■ I 1 1 ■ : i . ! ; 1 i . I : ' , I ! ■ I I , I I I I 1 1 1 I I , 

KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 
SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

llllllllllllllllllllllllllllllllllllllllllllllll 

SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 
RESULT 3 

US-09-847-249A-2 

Sequence 2, Application US/09847249A 
Publication No. US2003 0032588A1 
GENERAL INFORMATION: 
APPLICANT: MARSHALL, WILLIAM S. 
APPLICANT: STARK, KEVIN LEE 
TITLE OF INVENTION: GLUCAGON ANTAGONIST 
FILE REFERENCE: A-693 

CURRENT APPLICATION NUMBER: US/09/847,249A 
CURRENT FILING DATE: 2001-05-02 
PRIOR APPLICATION NUMBER: 60/201,436 
PRIOR FILING DATE: 2000-05-03 
NUMBER OF SEQ ID NOS : 80 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 2 
LENGTH: 228 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-09-847-249A-2 



Query Match 100.0%; Score 1238; DB 11; Length 228; 

Best Local Similarity 100.0%; Pred. No. 1.9e-99; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 60 

MIIMIMIIIIIIIIIIIMIIIIIIIIIIMMMIIIIIIIIIIIIIIIIIIIIM 

Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI SRTPEVTCVWDVSHEDPEVKFNWYV 60 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

MIIIIIIMIIIIIIMIMIMMIIMMIIIMMIIMIMIIIIIIIIIIIIII 



Db 


61 






Db 


121 


Qy 


181 


Db 


181 



61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 
KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

1 1 1 1 1 Mill - 1 1 [III I | [ mi | . 

KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 
SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IIIIIIIIIIIIIIIIIMIIIIIIIMMIIIIIIIIIIIIIMIII 

SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 
RESULT 4 

US-09-843-221A-2 

Sequence 2, Application US/09843221A 
Publication No. US20030039654A1 
GENERAL INFORMATION: 
APPLICANT: KOSTENUIK, PAUL 
APPLICANT: LIU # CHUAN-FA 
APPLICANT: LACEY, DAVID LEE 

TITLE OF INVENTION: MODULATORS OF RECEPTORS FOR PARATHYROID HORMONE AND 
PARATHYROID HORMONE - 

TITLE OF INVENTION: RELATED PROTEIN 
FILE REFERENCE: A-665B 

CURRENT APPLICATION NUMBER: US/09/843 , 221A 
CURRENT FILING DATE: 2001-04-26 
PRIOR APPLICATION NUMBER: 60/266,673 
PRIOR FILING DATE: 2001-02-06 
PRIOR APPLICATION NUMBER: 60/214,860 
PRIOR FILING DATE: 2000-06-28 
PRIOR APPLICATION NUMBER: 60/200,053 
PRIOR FILING DATE: 2000-04-27 
NUMBER OF SEQ ID NOS : 170 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 2 
LENGTH: 228 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-09-843-221A-2 

Query Match 100.0%; Score 1238; DB 11; Length 228; 

Best Local Similarity 100.0%; Pred. No. 1.9e-99; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 60 

MIIIIIMIIIIIIIIIMIIIIIMIIIIIIIIMMIIIIIIIIIMIIIIIIIIII 

Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 60 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAP I EKTI SKA 120 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIII 

Db 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 12 0 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

IMIIIIIIIIIMIIIIIIIIIIIIIIIIMMIIIIIIIIIMIIIIIIIIIIIIIII 

Db 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 18 0 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

MIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIMIIIIIIIMIII 



Db 181 SDGSFFLYSKLTVTOSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 



RESULT 5 

US-09-840-669B-2 

; Sequence 2, Application US/09840669B 

; Publication No. US20030040470A1 

; GENERAL INFORMATION: 

; APPLICANT: KOHNO, TADAHIKO 

; TITLE OF INVENTION: APO-AI/AII PEPTIDE DERIVATIVES 
; FILE REFERENCE: A- 690 

; CURRENT APPLICATION NUMBER: US/09/84 0 , 669B 

; CURRENT FILING DATE: 2002-06-07 

; PRIOR APPLICATION NUMBER: 60/198,920 

; PRIOR FILING DATE: 2000-04-21 

; NUMBER OF SEQ ID NOS : 11 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 2 

LENGTH: 228 

TYPE : PRT 

ORGANISM: Homo sapiens 
US-09-840-669B-2 

Query Match 100.0%; Score 1238;' DB 11; Length 228; 

Best Local Similarity 100.0%; Pred. No. 1.9e-99; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 60 

IIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEWCVWDVSHEDPEVKFNWYV 60 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKA 12 0 

MIIIIIMIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIMIIMII 

Db 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKA 12 0 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 18 0 

IIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIII 

Db 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

I I! II I II I MINI III M I! II II II 

Db 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 



RESULT 6 

US-10-269-806-32 

; Sequence 32, Application US/10269806 

; Publication No. US20030176352A1 

; GENERAL INFORMATION: 

; APPLICANT: Min, Hosung 

; APPLICANT: Sitney, Karen 

; APPLICANT: Hartley, Cynthia 

; TITLE OF INVENTION: Peptides and Related Compounds Having Thrombopoietic 
Activity 

; FILE REFERENCE: A- 750 

; CURRENT APPLICATION NUMBER: US/10/2 69 , 8 06 
; CURRENT FILING DATE: 2002-10-10 



NUMBER OF SEQ ID NOS : 199 
SOFTWARE: Patent In version 3.1 
SEQ ID NO 32 
LENGTH: 22 8 
TYPE : PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Synthesized Peptide Sequence 
US-10-269-806-32 

Query Match 100.0%; Score 1238; DB 12; Length 228; 

Best Local Similarity 100.0%; Pred. No. 1.9e-99; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 60 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIII 

Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

Qy '61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 12 0 

IMIIIMIIIIIIIIIIMIIIIIMIIIIIIIMIIIIIIIIIIIMIIMMIIIII 

Db 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

IMIIIMIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIMIIIIM 

Db 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

Ml 1 1 1 1 1 1 1 1 1 1 ! I 1 1 II 1 1 1 IMIII 1 1 1 1 MM 1 1 1 1 1 1 1 1 1 II 

Db 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 



RESULT 7 
US-10-145-206-2 

; Sequence 2, Application US/10145206 

; Publication No. US20030195156A1 

; GENERAL INFORMATION: 

; APPLICANT : MIN, HOSUNG 

; APPLICANT: HSU, HAILING 

; APPLICANT: ZIONG, FEI 

TITLE OF INVENTION: PEPTIDES AND RELATED MOLECULES THAT BIND TO TALL-1 
; FILE REFERENCE: A-743 

; CURRENT APPLICATION NUMBER: US/ 1 0/145 , 2 06 
; CURRENT FILING DATE: 2002-05-13 

PRIOR APPLICATION NUMBER: US 60/290,196 
PRIOR FILING DATE: 2001-05-11 
; NUMBER OF SEQ ID NOS: 197 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 2 

LENGTH: 228 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-145-206-2 



Query Match 100.0%; Score 1238; DB 12; Length 228; 

Best Local Similarity 100.0%; Pred. No. 1.9e-99; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 



MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 60 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 H 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 

MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI SRTPEVTCVWDVSHEDPEVKFNWYV 6 0 
DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 12 0 

iiiiiiiiiiiiMiiiiiiiiiMiiiiiiiMiiiiiMMiiiiiiiiiiiiiiiii 

DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKA 120 
KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 18 0 

II II M ! t II II II IIMMI I Ml II ,11 llilll 

KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 
SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IIIIIIIIIIIIIIIIIIIMMIIIIIIIIMIIIMMIIIIIIII 

SDGS FFLYS KLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLS PGK 228 
RESULT 8 

US-10-269-806-152 

Sequence 152, Application US/10269806 
Publication No. US20030176352A1 
GENERAL INFORMATION: 
APPLICANT: Min, Hosung 
APPLICANT: Sitney, Karen 
APPLICANT: Hartley, Cynthia 

TITLE OF INVENTION: Peptides and Related Compounds Having Thrombopoietic 
Activity 

FILE REFERENCE: A-750 

CURRENT APPLICATION NUMBER: US/l 0/2 69 , 8 06 
CURRENT FILING DATE: 2002-10-10 
NUMBER OF SEQ ID NOS : 199 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 152 
LENGTH: 243 
TYPE : PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Synthesized Peptide Sequence 
US-10-269-806-152 

Query Match 100.0%; Score 1238; DB 12; Length 243; 

Best Local Similarity 100.0%; Pred. No. 2e-99; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI SRTPEVTCVWDVSHEDPEVKFNWYV 60 

IIIIIMI IIIIIMIMI II II II! I II Mlllll II IMIIIIII 

Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI SRTPEVTCVWDVSHEDPEVKFNWYV 60 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

MIIIIIIIMIIIMIIIIIIIIIIMIIIIIMIIMIIIIIIIIIIIMIIIIIIII 

Db 61 DGVEWNAKTKPREEQYNSTYRWSVljTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKA 120 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

IMIMIIIIIIMIIIIIIIIIIIIIIIIMIIIMIIIIIIMIIIIIMIIMIIII 

Db 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 



IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI 

Db 181 S DGS FFL YS KLTVDKS RWQQGNVFS CS VMHEALHNHYTQKSLSLS PGK 228 



RESULT 9 

US-10-269-695-243 

; Sequence 243, Application US/10269695 
; Publication No. US20030229023A1 
; GENERAL INFORMATION: 

; APPLICANT: OLINER, JONATHAN DANIEL 
; APPLICANT : MIN, HOSUNG 

; TITLE OF INVENTION: SPECIFIC BINDING AGENTS OF HUMAN ANGIOPOI ETIN-2 
; FILE REFERENCE: A-801A 

; CURRENT APPLICATION NUMBER: US/lO/269 , 695 

; CURRENT FILING DATE: 2002-10-10 

; PRIOR APPLICATION NUMBER: US 60/414,155 

; PRIOR FILING DATE: 2002-09-27 

; PRIOR APPLICATION NUMBER: US 60/328,624 

/ PRIOR FILING DATE: 2001-10-11 

; NUMBER OF SEQ ID NOS : 359 

; SOFTWARE: Patentln version 3.1 

; SEQ ID NO 243 

LENGTH: 250 

TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: DNA encoding peptide capable of binding to Ang-2 
US-10-269-695-243 

Query Match 100.0%; Score 1238; DB 12; Length 250; 

Best Local Similarity 100.0%; Pred. No. 2.1e-99; 

Matches 22 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 60 

1 II II Ml II! I MM I II I MM IMIIMI IMMIMIMI 

Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI SRTPEVTCVWDVSHEDPEVKFNWYV 60 

QY 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 12 0 

IIIIIIIMIIIIIIIIIIIIIIIIIIIIIIMIIMIIIMIIIIIIIIIIIMIIIII 

Db 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

QY 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 18 0 

IIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIMIMI 

Db 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 18 0 

Qy 181 S DGS FFLYS KLTVDKS RWQQGNVFSCS VMHEALHNHYTQKSLSLS PGK 228 

IMIIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIIIIIIIII 

Db 181 S DGS FFLYS KLTVDKS RWQQGNVFSCS VMHEALHNHYTQKSLSLS PGK 228 



RESULT 10 
US-10-410-998-243 

; Sequence 243, Application US/10410998 
; Publication No. US2 0030236193A1 
; GENERAL INFORMATION: 

; APPLICANT: OLINER, JONATHAN DANIEL 
; APPLICANT: MIN, HOSUNG 



TITLE OF INVENTION: SPECIFIC BINDING AGENTS OF HUMAN ANGIOPOIETIN-2 
FILE REFERENCE: A-801A 

CURRENT APPLICATION NUMBER: US/10/4 10 , 998 
CURRENT FILING DATE: 2 003-04-09 
PRIOR APPLICATION NUMBER: US/10/269 , 695 
PRIOR FILING DATE: 2002-10-10 
PRIOR APPLICATION NUMBER: US 60/414,155 
PRIOR FILING DATE: 2002-09-27 
PRIOR APPLICATION NUMBER: US 60/328,624 
PRIOR FILING DATE: 2001-10-11 
NUMBER OF SEQ ID NOS : 359 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 243 
LENGTH: 250 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: DNA encoding peptide capable of binding to Ang-2 
US-10-410-998-243 

Query Match 100.0%; Score 1238; DB 12; Length 250; 

Best Local Similarity 100.0%; Pred. No. 2.1e-99; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCWVDVSHEDPEVKFNWYV 60 

I I ! : I -I - I I i I i ! : I I II : III I . : I I i I I - i : I ! 
Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 12 0 

IIIIIIIIIIIIMIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIMIIMIIMI 

Db 61 DGVE VHNAKTKPREEQYNSTYRWSVLT VXtHQDWLNGKEYKCKVSNKALPAP I EKTI SKA 120 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

MIMIMIIIIMIIIIIIMIIIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIMM 

Db 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 18 0 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI 

Db 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 



RESULT 11 
US-09-840-277-109 

; Sequence 109, Application US/09840277 

; Patent No. US20020168363A1 

; GENERAL INFORMATION: 

; APPLICANT: FEIGE, ULRICH 

; APPLICANT: KOHNO, TADAHIKO 

; APPLICANT: LACEY, DAVID LEE 

; APPLICANT: BOONE, THOMAS CHARLES 

; TITLE OF INVENTION: INTEGRI N/ADHES I ON ANTAGONISTS 
; FILE REFERENCE: A-688A 

; CURRENT APPLICATION NUMBER: US/09/84 0 , 277 

CURRENT FILING DATE: 2001-08-14 
; PRIOR APPLICATION NUMBER: 60/198,919 

PRIOR FILING DATE: 2000-04-21 

PRIOR APPLICATION NUMBER: 60/201,394 



PRIOR FILING DATE: 2000-05-03 
; NUMBER OF SEQ ID NOS : 135 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 109 
LENGTH: 282 
TYPE : PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Echistatin Fc-peptide 
NAME/KEY: misc_f eature 
LOCATION: (1) . . (1) 
OTHER INFORMATION: Ndel site 
NAME/KEY: misc_feature 
LOCATION: (854) . . (854) 
OTHER INFORMATION: BamHI site 
US-09-840-277-109 



Query Match 100.0%; Score 1238; DB 10; Length 282; 

Best Local Similarity 100.0%; Pred. No. 2.5e-99; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


l 


MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 


60 






Ml II 1 UIIIM'I UUIIII III III 1 IIIIUIMII || | 




Db 


l 


MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 


60 


Qy 


61 


DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 


120 






IIMIIIIMIIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIMI 




Db 


61 


DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 


120 


Qy 


121 


KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 


180 






IIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIMIIIIIII 




Db 


121 


KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 


180 


Qy 


181 


SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 








IIIIIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIMMIIIIIM 




Db 


181 


SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 





RESULT 12 
US-09-389-782-8 

; Sequence 8, Application US/09389782 

; Publication No. US20030144187A1 

; GENERAL INFORMATION: 

; APPLICANT: Wooden, Scott K. 

; APPLICANT: Mann, Michael B. 

; APPLICANT: Dunstan, Colin R. 

; TITLE OF INVENTION: OPG Fusion Protein Compositions and Methods 
; FILE REFERENCE: A-604 

; CURRENT APPLICATION NUMBER: US/09/389,782 

CURRENT FILING DATE: 1999-09-03 
; NUMBER OF SEQ ID NOS: 50 
; SOFTWARE: Patentln Ver . 2.1 
; SEQ ID NO 8 

LENGTH: 4 01 

TYPE : PRT 

ORGANISM: Human 
US-09-389-782-8 



Query Match 100.0%; Score 1238; DB 12; Length 401; 

Best Local Similarity 100.0%; Pred. No. 3.8e-99; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 60 

I I!; N I I ! I! VI ^ !: I l: INI !IM:II 

Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 60 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 II 1 1 MM I II II I 1 1 1 1 II 1 1 1 1 1 II II 1 1 IIMIII 1 1 

Db 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

1 1 1 1 II M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill 1 1 1 1 1 M 1 1 1 IM 1 1 1 1 II II 1 1 Ml MM 1 1 

Db 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 18 0 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

M II I IM 1 1 1 1 1 1 1 1 M 1 1 1 1 1 MM 1 1 1 1 1 M 1 1 1 II M II I II I 

Db 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 



RESULT 13 
US-10-215-297-2 

; Sequence 2, Application US/10215297 

; Publication No. US20020192222A1 

; GENERAL INFORMATION: 

; APPLICANT: Blumberg, Richard S. 

; APPLICANT: Simister, Neil E. 

; APPLICANT: Lencer, Wayne I. 

; TITLE OF INVENTION: RECEPTOR SPECIFIC TRANSEPITHELIAL TRANSPORT OF 
; TITLE OF INVENTION: THERAPEUTICS 
; FILE REFERENCE: S1383/7003 

CURRENT APPLICATION NUMBER: US/10/215 , 297 
; CURRENT FILING DATE: 2002-08-08 
; PRIOR APPLICATION NUMBER: US 08/578,171 
; PRIOR FILING DATE: 1995-12-29 
; PRIOR APPLICATION NUMBER: US 08/374,159 
; PRIOR FILING DATE: 1995-01-17 
; PRIOR APPLICATION NUMBER: US 09/122,144 
; PRIOR FILING DATE: 1998-07-24 
; NUMBER OF SEQ ID NOS : 22 
; SEQ ID NO 2 

LENGTH: 22 9 

TYPE : PRT 

ORGANISM: Homo sapiens 
US-10-215-297-2 

Query Match 99.7%; Score 1234; DB 14; Length 22 9; 

Best Local Similarity 99.6%; Pred. No. 4.2e-99; 

Matches 227; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 60 

= 1 1 1 i 1 1 1 1 1 1 r 1 1 1 1 1 1 r i f 1 1 1 1 1 1 1 1 1 e 1 1 r i 1 1 1 1 1 1 1 1 1 1 r 1 1 1 i i f e i f 1 1 1 1 1 

Db 2 VDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI SRTPEVTCWVDVSHEDPEVKFNWYV 61 



Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 



Mil : I li, I I I III i. Ill II , I IN III M 

Db 62 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 121 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

1 1 III 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1' I MINI II MINI I II! II M II 1 1 1 1 1 1 1 1 Ml 

Db 122 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 181 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IIMIIIMIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIIIIMI 
Db 182 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 229 



RESULT 14 
US-10-215-298-2 

; Sequence 2, Application US/10215298 

; Publication No. US20030012789A1 

; GENERAL INFORMATION: 

; APPLICANT: Blumberg, Richard S. 

; APPLICANT: Simister, Neil E. 

; APPLICANT: Lencer, Wayne I. 

; TITLE OF INVENTION: RECEPTOR SPECIFIC TRANSEPITHELIAL TRANSPORT OF 
THERAPEUTICS 

; FILE REFERENCE: S1383/7003 

; CURRENT APPLICATION NUMBER: US/10/215 , 298 

; CURRENT FILING DATE: 2002-08-08 

; PRIOR APPLICATION NUMBER: US 09/122,144 

PRIOR FILING DATE: 1998-07-24 
; NUMBER OF SEQ ID NOS : 22 
; SEQ ID NO 2 

LENGTH: 229 

TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-215-298-2 



Query Match 99.7%; Score 1234; DB 15; Length 229; 

Best Local Similarity 99.6%; Pred. No. 4.2e-99; 

Matches 227; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 



Qy 


1 


MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 


60 






M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III lllllllll 1 1 II 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 




Db 


2 


VDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 


61 


Qy 


61 


DGVEWNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 


120 






1 M 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


62 


DGVEVHNAKTKPREEQYNSTYR WSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 


121 


Qy 


121 


KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 


180 






IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIII 




Db 


122 


KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 


181 


Qy 


181 


SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 








llllllllllllllllllllllllllllllllllllllllllllllll 




Db 


182 


SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 22 9 





RESULT 15 
US-10-145-206-111 



; Sequence 111, Application US/10145206 

; Publication No. US20030195156A1 

; GENERAL INFORMATION: 

; APPLICANT: MIN, HOSUNG 

; APPLICANT: HSU, HAILING 

; APPLICANT: ZIONG, FEI 

; TITLE OF INVENTION: PEPTIDES AND RELATED MOLECULES THAT BIND TO TALL-1 
; FILE REFERENCE: A-743 

; CURRENT APPLICATION NUMBER: US/ 10/145 , 2 06 

; CURRENT FILING DATE: 2002-05-13 

; PRIOR APPLICATION NUMBER: US 60/290,196 

PRIOR FILING DATE: 2001-05-11 
; NUMBER OF SEQ ID NOS : 197 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 111 
LENGTH: 248 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE: 

OTHER INFORMATION: TALL-1 inhibitory peptibodies 
US-10-145-206-111 



Query Match 99.7%; Score 1234; DB 12; Length 248; 

Best Local Similarity 99.6%; Pred. No. 4.6e-99; 

Matches 227; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCWVDVSHEDPEVKFNWYV 60 

Ml MM MMMMMMMMMMMMMMMMMMMMI MMMMI 

Db 21 VX)KTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 80 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

Ml MM MM MMMMMMMMMMMMMMMMMM MMMMI 

Db 81 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 14 0 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

IIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI 

Db 141 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 200 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

I I M M I I I I I I II I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I II 
Db 201 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 248 



Search completed: January 16, 2004, 09:46:30 
Job time : 34 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2 004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



January 16, 2004, 09:36:27 ; Search time 36 Seconds 

(without alignments) 
1634.333 Million cell updates/sec 

US-09-843-221A-2 
1238 

1 MDKTHTCPPCPAPELLGGPS MHEALHNHYTQKSLSLS PGK 228 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 



830525 seqs, 258052604 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



830525 



Database : SPTREMBL_23 : * 

1: sp_archea:* 

2: sp_bacteria : * 

3 : sp_f ungi : * 

4 : sp_human : * 

5 : sp_invertebrate : * 

6: sp_mammal:* 

7 : sp_mhc : * 

8: sp_organelle: * 

9 : s P_P n age : * 

10: sp_plant:* 

11: sp_rodent : * 

12 : sp_virus : * 

13 : sp_vertebrate : * 

14: sp_unclassif ied : * 

15: sp_rvirus:* 

16: sp_bacteriap : * 

17: sp_archeap:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result Query 

No. Score Match Length DB ID 



Description 



1 


1233 


99 


. 6 


471 


4 


Q8TC77 


Q8tc77 homo 


sapien 


2 


1233 


99 


. 6 


701 


4 


096P08 


Q96pq8 homo 


sapien 


3 


1142 


92 


. 2 


521 


4 


Q8N4Y9 


Q8n4y9 homo 


sapien 


4 


1135 


91 


. 7 


473 


4 


Q8TC63 


Q8tc63 homo 


sapien 


5 


1131 


91 


. 4 


509 


4 


OflNRI 7 


Q8nfl7 homo 


sapien 




909 


73 


. 4 


337 




09RM74 


Q95m34 equus cabal 


7 


831.5 


67 


. 2 


469 


11 


X\.«J V J 


yO I J V j LULlo 


UlUbCUXU 


8 


828 . 5 


66 


. 9 


463 


11 


Q99LC4 


^j^XU^t IllUfa 


muscuiu 


9 


824 . 5 


66 


. 6 


437 


11 


Q9R1A4 


(~\ Q v~ 1 yl mi in 

yjl Id^l IUUS 


musculu 


10 


811 


65 


. 5 


473 


11 


09D8L4 


\^yyjiO±*± IUUS 


muscuiu 


11 


805 


65 


. 0 


468 


11 


Q99L31 


yyyxjx mus 


musculu 


12 


805 


65 


. 0 


473 


11 


Q99L25 


yyyxzo mus 


muscuiu 


13 


780 


63 


. 0 


473 


11 


Q91Z05 


Q91z05 mus 


musculu 


14 


780 


63 


. 0 


474 


11 


\£ \~> I* — > no 


Q8r3h6 mus 


musculu 


15 


358 


28 , 


. 9 


375 


4 


09BSZ1 


Q9bszl homo 


sapien 


16 


358 


28 , 


. 9 


597 


4 


O9BU10 


Q9bul0 homo 


sapien 


17 


358 


28 , 


. 9 


597 


4 


09RORfi 


Q9bqb8 homo 


sapien 


18 


358 


28 , 


. 9 


597 


4 


096RR9 


Q96bb9 homo 


sapien 


19 


348 


28 . 


. i 


588 


4 


08W1IX4 


Q8wux4 homo 


sapien 


20 


348 


28 . 


. i 


613 


4 


O96EY0 

\£ VJ l-i -L \J 


Q96ey0 homo 


sapien 


21 


348 


28 . 


, l 


613 


4 




Q8wukl homo 


sapien 


22 


348 


28 , 


. l 


614 


4 


096GA6 


Q96ga6 homo 


sapien 


23 


348 


28 . 


. l 


618 


4 


096AA6 


Q96aa6 homo 


sapien 


24 


347 


28 . 


. 0 


613 


11 




Q8vcx7 mus 


musculu 


25 


272 . 5 


22 , 


. 0 


684 


13 


Q90544 


Q90544 ginglymosto 


26 


272 


22 , 


. 0 


384 


4 


O9UP60 


Q9up60 homo 


sapien 


27 


272 


22 . 


. 0 


493 


4 


08NCL6 


Q8ncl6 homo 


sapien 


28 


272 


22 . 


. 0 


494 


4 


096K68 


Q96k68 homo 


sapien 


29 


272 


22 . 


. 0 


496 


4 




Q96kx8 homo 


sapien 


30 


269 


21 . 


, 7 


4 96 


4 




Q96dk0 homo 


sapien 


31 


269 


21 . 


, 7 


4 99 


4 




Q8n5k4 homo 


sapien 


32 


267 


21 . 


, 6 


416 


4 


09NPP6 


Q9npp6 homo 


sapien 


33 


267 


21 . 


. 6 


4 97 


4 




Q8wy24 homo 


sapien 


34 


267 


21 . 


, 6 


500 


4 


O9RRV0 


Q9brv0 homo 


sapien 


35 


261 


21 . 


, l 


486 


11 


Q91Z07 


Q91z07 mus 


musculu 


36 


261 


21 . 


, l 


487 


11 


Q9 9KA4 


Q99ka4 mus 


musculu 


37 


260 


21 . 


. 0 


426 


11 


Q9DCD9 


Q9dcd9 mus 


musculu 


38 


253 . 5 


20 . 


. 5 


481 


11 


Q91WT3 


Q91wt3 mus 


musculu 


39 


253 .5 


20 . 


5 


481 


11 


Q91WT1 


Q91wtl mus 


musculu 


40 


253.5 


20. 


.5 


482 


11 


Q91X92 


Q91x92 mus 


musculu 


41 


253 .5 


20. 


5 


482 


11 


Q8K172 


Q8kl72 mus 


musculu 


42 


253 .5 


20. 


5 


484 


11 


Q8VEA0 


Q8vea0 mus 


musculu 


43 


253 .5 


20. 


5 


488 


11 


Q91WR1 


Q91wrl mus 


musculu 


44 


253.5 


20. 


5 


488 


11 


Q8K0F2 


Q8k0f2 mus 


musculu 


45 


253.5 


20. 


5 


489 


11 


Q8VCX4 


Q8vcx4 mus 


musculu 



ALIGNMENTS 



RESULT 1 
Q8TC77 

ID Q8TC77 PRELIMINARY; PRT; 471 AA. 

AC Q8TC77; 

DT 01-JUN-2002 (TrEMBLrel . 21, Created) 

DT 01-JUN-2002 (TrEMBLrel. 21, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 



DE Hypothetical protein. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606 ; 

RN [1] 

RP SEQUENCE FROM N . A . 

RC TISSUE=Spleen; 

RA Strausberg R. ; 

RL Submitted (FEB-2002) to the EMBL/GenBank/DDB J databases. 

DR EMBL; BC02428 9; AAH2428 9.1; 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003006; Ig_MHC. 

DR InterPro; IPR003596; Ig_v. 

DR Pfam; PF00047; ig; 4. 

DR SMART; SM00406; IGv; 1. 

DR PROSITE; PS50835; IG_LIKE; 4. 

DR PROSITE; PS00290; IG_MHC; 2. 

KW Hypothetical protein. 

SQ SEQUENCE 471 AA; 51791 MW; 388F7F4CF588660E CRC64 ; 

Query Match 99.6%; Score 1233; DB 4; Length 471; 

Best Local Similarity 100.0%; Pred. No. 3.9e-108; 

Matches 227; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 2 DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVD 61 

IIIIMIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMMIIIIIIII 

Db 24 5 DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCWVDVSHEDPEVKFNWYVD 304 

Qy 62 GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAK 121 

IIMIMIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIII 

Db 3 05 GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAK 364 

Qy 122 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 181 

IIIMIIIIIMIIIMIIIIIIIIIIIIIMIIIIIIIIIIMMIIIIIIIIIIIIII 

Db 365 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 424 

Qy 182 DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IIIIIIMIIIIIIIIIIIMIIMIIIIIIIIIIIIIIIIIIMII 

Db 425 DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 471 



RESULT 2 
Q96PQ8 

ID Q96PQ8 PRELIMINARY; PRT; 701 AA. 

AC Q96PQ8; 

DT 01-DEC-2001 (TrEMBLrel . 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Factor VII active site mutant immuno con jugate . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=21477448; PubMed=115 93034 ; 



RA Hu Z . , Garen A. ; 

RT "Targeting tissue factor on tumor vascular endothelial cells and tumor 

RT cells for immunotherapy in mouse models of prostatic cancer."; 

RL Proc. Natl. Acad. Sci. U.S.A. 98:12180-12185(2001). 

CC -!- SIMILARITY: BELONGS TO PEPTIDASE FAMILY SI. 

DR EMBL; AF272774; AAK58686.1; -. 

DR HSSP; P00761; 1AN1 . 



DR 


InterPro 


IPR000152 


• Asx_hydroxyl . 


DR 


InterPro 


• IPR001314, 


• Chymo trypsin. 


DR 


InterPro 


IPR000742 


• EGF 2. 


DR 


InterPro 


IPR001881, 


• EGF_Ca. 


DR 


InterPro 


IPR001438, 


• EGF I I . 


DR 


InterPro 


IPR006209, 


■ EGF_like. 


DR 


InterPro 


IPR002383 , 


■ GLA_blood. 


DR 


InterPro 


IPR007110, 


■ Ig-like. 


DR 


InterPro 


IPR003597, 


■ Ig_cl. 


DR 


InterPro 


IPR003006, 


■ Ig_MHC. 


DR 


InterPro 


IPR001254 , 


Serjprotease Try 


DR 


InterPro 


IPR000294, 


VitK_dep_GLA. 



DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
SQ 



Pfam; PF00008; EGF; 2. 
Pfam; PF00594; gla; 1. 
Pfam; PF00047; ig ; 2. 
Pfam; PF00089; trypsin; 1. 
PRINTS; PRO 072 2; CHYMOTRYPSIN . 
PRINTS; PRO 00 10; EGFBLOOD . 
PRINTS; PRO 00 01; GLABLOOD . 
SMART; SM00179; EGF_CA; 1. 
SMART; SM00069; GLA; 1. 
SMART; SM00407; IGcl; 1. 
SMART; SM00020; Tryp SPc; 1. 



PROSITE 
PROSITE 
PROSITE 
PROSITE 
PROSITE 
PROSITE 
PROSITE 
PROSITE 
PROSITE 
PROSITE 



PS00010 
PS00022 
PS01186 
PS01187 
PS00011 
PS50835 
PS00290 
PS50240 
PS00134 
PS00135 



1 . 



EGF -like domain; 



ASX_HYDROXYL; 
EGF_1; 1. 
EGF_2; 1. 
EGF_CA; 1. 
GLU_CARBOXYLATI ON ; 
IG_LIKE; 2. 
IG_MHC; 1. 
TRYPSIN_DOM; 1. 
TRYPSIN_HIS; 1. 
TRYPS IN_SER ; 1. 
Hydrolase; Protease; Serine protease. 



SEQUENCE 701 AA; 77826 MW; 94AC6CEB42CC992F CRC64 ; 



Query Match 99.6%; Score 1233; DB 4; Length 7 01; 

Best Local Similarity 100.0%; Pred. No. 6.4e-108; 

Matches 227; Conservative 0; Mismatches 0; Indels 0; 



Gaps 



0; 



Qy 
Db 

Qy 
Db 

Qy 



2 DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVD 61 

I I I I I I I I ' I i . I I 11:111 ' 1 1 1 1 1 II. 1 1 1 

475 DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCWVDVSHEDPEVKFNWYVD 534 



62 



121 



GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAK 

IIIIIMIIIIIIIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

535 GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAK 594 



122 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 



181 



Db 



595 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 654 



Qy 182 DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IMMIIMIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 655 DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 701 

RESULT 3 
Q8N4Y9 

ID Q8N4Y9 PRELIMINARY; PRT; 521 AA. 

AC Q8N4Y9; 

DT 01-OCT-2002 (TrEMBLrel. 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Hypothetical protein. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N . A . 

RC TISSUE=Primary B-Cells from Tonsils; 

RA Strausberg R. ; 

RL Submitted (JUN-2002) to the EMBL/GenBank/ DDB J databases. 

DR EMBL; BC033178; AAH33178.1; -. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003597; Ig_cl. 

DR InterPro; IPR003006; Ig_MHC. 

DR InterPro; IPR003596; Ig_v. 

DR Pfam; PF00047; ig; 4. 

DR SMART; SM00407; IGcl; 3. 

DR SMART; SM00406; IGv; 1. 

DR PROSITE; PS50835; IG_LIKE; 4. 

DR PROSITE; PS00290; IG_MHC; 2. 

KW Hypothetical protein. 

SQ SEQUENCE 521 AA; 57156 MW; 2AC7D22E72D6CAA2 CRC64 ; 

Query Match 92.2%; Score 1142; DB 4; Length 521; 

Best Local Similarity 92.1%; Pred. No. 1.8e-99; 

Matches 209; Conservative 9; Mismatches 9; Indels 0; Gaps 0; 
Qy 2 DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVD 61 

I II I 'MINI INIIII I II IIUIIIIIII INI 1 1 1 1 

Db 2 95 DTPPPCPRCPAPELLGGPSVFLFPPKPKXJTLMISRTPEVTCVWDVSHEDPEVQFKWYVD 354 

Qy 62 GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAK 121 

IIIIIMIIMIIIhllhllllMIIIIIIMIIIIIIIIMMIIMIMIIMI I 

Db 355 GVEVHNAKTKPREEQFNSTFRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKTK 414 

Qy 122 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 181 

; MlM. ' M , II, I I I. I II; II ■: Ml- II 
Db 415 GQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESSGQPENNYNTTPPMLDS 474 

Qy 182 DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IIMIIIIIIIMMIIIIIhllMIIIMIII :|IMIIIIIII 

Db 475 DGSFFLYSKLTVDKSRWQQGNIFSCSVMHEALHNRFTQKSLSLSPGK 521 



RESULT 
Q8TC63 
ID 
AC 
DT 
DT 
DT 
DE 
OS 



OC 
OC 
OX 
RN 
RP 
RC 
RA 
RL 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
SQ 



PRELIMINARY; 



PRT; 473 AA. 



Created) 

Last sequence update) 
Last annotation update) 



Q8TC63 
Q8TC63; 

01-JUN-2002 (TrEMBLrel . 21, 
01-JUN-2002' (TrEMBLrel. 21, 
01-MAR-2003 (TrEMBLrel. 23, 
Hypothetical protein. 
Homo sapiens (Human) . 
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
NCBI_TaxID=9606; 
[1] 

SEQUENCE FROM N.A. 
TISSUE=Kidney; 
Strausberg R. ; 

Submitted (MAR-2002) to the EMBL/ GenBank / DDB J databases. 
EMBL; BC025985; AAH25985.1; 



InterPro; 
InterPro; 
InterPro; 
InterPro; 



IPR000923; 
IPR007110; 
IPR003006; 
IPR003596; 



BlueCu_l . 
Ig-like . 
Ig_MHC. 
Ig_v . 



Pfam; PF00047; ig ; 4. 
SMART; SM004 06; IGv; 1. 
PROSITE; PS00196; COPPER_BLUE; 
PROSITE; PS50835; IG_LIKE; 4. 
PROSITE; PS00290; IG_MHC; 3. 
Hypothetical protein. 
SEQUENCE 473 AA; 51986 MW; 



E29920B09BA369F5 CRC64; 



Query Match 91.7%; 
Best Local Similarity 93.7%; 
Matches 208; Conservative 



Score 1135; DB 4; 
Pred. No. 7.1e-99; 
8; Mismatches 6; 



Length 473; 



Indels 



0 ; Gaps 



0; 



Qy 



Db 



7 CPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVH 66 

II Mill II II I II II I II II I II II III II III II III I II 1 1 hi II 1 1 III I II 

252 CPSCPAPEFLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSQEDPEVQFNWYVDGVEVH 311 



Qy 

Db 

Qy 

Db 

Qy 
Db 



67 NAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAP I EKTI SKAKGQPRE 126 

Ml I i' ; I ! ; i Ml ll I Ih MM 

312 NAKTKPREEQFNSTYRWS VLTVLHQDWLNGKEYKCKVSNKGLPSS I EKTI SKAKGQPRE 371 



127 



186 



PQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFF 

M! I M ■ M II M:M | | >'|, ■ ::i || 

372 PQVYTLPPSQEEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFF 431 



187 



228 



LYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 

llhllMllllhllMIIIIIIIIIIIIIIIIIIIII II 

432 LYSRLTVDKSRWQEGNVFSCSVMHEALHNHYTQKSLSLSLGK 4 73 



RESULT 5 

Q8NF17 

ID Q8NF17 

AC Q8NF17; 

DT 01-OCT-2002 



PRELIMINARY; 



PRT; 



509 AA. 



(TrEMBLrel. 22, Created) 



DT 01-OCT-2002 (TrEMBLrel . 22, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE FLJ00385 protein (Fragment) . 

GN FLJ00385. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Spleen ; 

RA Jikuya H. , Takano J., Kikuno R. , Nagase T. , Ohara 0. ; 

RT "The nucleotide sequence of a long cDNA clone isolated from human 

RT spleen."; 

RL Submitted (JUL-2002) to the EMBL/ GenBank / DDB J databases. 

DR EMBL; AK090464; BAC03445.1; 

DR InterPro; IPR007110; Ig-like. 

DR InterPro ; IPR003597; Ig_cl . 

DR InterPro; I PRO 03 006; Ig_MHC. 

DR Pfam; PF00047; ig ; 3. 

DR SMART; SM00407; IGcl; 3. 

DR PROSITE; PS50835; IG_LIKE; 3. 

DR PROSITE; PS00290; IG_MHC; 2. 

FT NON_TER 1 1 

SQ SEQUENCE 509 AA; 56111 MW; 0894 98D8076E863C CRC64 ; 

Query Match 91.4%; Score 1131; DB 4; Length 509; 

Best Local Similarity 92.0%; Pred. No. 1.9e-98; 

Matches 207; Conservative 9; Mismatches 9; Indels 0; Gaps 0; 
Qy 2 DKTHTCPPCPAPEL^GPSVFLFPPKPKDTLMISRTPEWCVVVIDVSHEDPEWPNWYVD 61 

I II MIMMIIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIIIIhl 1 1 1 1 

Db 214 DTPPPCPRCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVQFKWYVD 273 

Qy 62 GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAK 121 

1 1 1 I I I I I I I I I I I I : I I h I I I 1 1 I I I 1 1 I 1 1 I I 1 1 I I 1 1 1 1 1 1 I I I 1 1 1 1 1 1 1 I I | I 

Db 274 GVEVHNAKTKPREEQFNSTFRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKTK 333 

Qy 122 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 181 

iiiiiiiiiiiiiihhiiiiiiiiiiiiiiiiiiiiiiiihiiiiiii in hiii 

Db 334 GQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESSGQPENNYNTTPPMLDS 3 93 

Qy 182 DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSP 226 

IIIIIIIIIIIIMIMIIIhllllllllllll .MlMll 

Db 394 DGSFFLYSKLTVDKSRWQQGNIFSCSVMHEALHNRFTQKSLSLSP 438 

RESULT 6 
Q95M34 

ID Q95M34 PRELIMINARY; PRT; 337 AA. 

AC Q95M34; 

DT 01-DEC-2001 (TrEMBLrel. 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Immunogobulin gamma 1 heavy chain constant region (Fragment) . 

GN IGHC1 . 

OS Equus cabal lus (Horse) . 



OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Perissodactyla; Equidae; Equus . 

OX NCBI_TaxID=9796; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Wagner B . ; 

RL Submitted (DEC-2000) to the EMBL / GenBank / DDB J databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=98383416; PubMed=9717671 ; 

RA Wagner B., Overesch G. , Sheoran A., Holmes M . , Richards C, 

RA Leibold W., Radbruch A. ; 

RT "Organization of the equine immunoglobulin heavy chain constant region 

RT genes. III. Alignment of c-mu, c-gamma, c-epsilon and c-alpha genes ." ; 

RL Immunobiology 199:105-119(1998). 

DR EMBL; AJ300675; CAC44624.1; -. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003597; Ig_cl . 

DR InterPro; IPR003006; Ig_MHC. 

DR Pfam; PF00047; ig ; 2. 

DR SMART; SM00407; IGcl; 2. 

DR PROSITE; PS50835; IG_LIKE ; 3. 

DR PROSITE; PS00290; IG_MHC; 2. 

FT NON_TER 1 1 

SQ SEQUENCE 337 AA; 37438 MW; A60BF2B01DEFD1F6 CRC64 ; 

Query Match 73.4%; Score 909; DB 6; Length 337; 

Best Local Similarity 71.9%; Pred. No. l.le-77; 

Matches 161; Conservative 32; Mismatches 29; Indels 2; Gaps 1; 
Qy 7 CPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVH 66 

II llllllllllllhlll ; ! i i , i I : 1 1 hhlllllhlllll 

Db 114 CPKCPAPELLGGPSVFIFPPNPKDTLMITRTPEVTCWVDVSQENPDVKFNWYMDGVEVR 173 

Qy 67 NAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAKGQPRE 126 

I hhllhllllllllll : llllhllhlllhhlll Mhlhl I I 
Db 174 TATTRPKEEQFNSTYRWSVLRIQHQDWLSGKEFKCKVNNQALPQPIERTITKTKGRSQE 233 

Qy 127 PQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQP - - ENNYKTTPPVLDSDGS 184 

I I I I I I MhhMiMMM Ml H MIMIIII I I II Mill 
Db 234 PQVYVLAPHPDELSKSKVSVTCLVKDFYPPEINIEWQSNGQPELETKYSTTQAQQDSDGS 2 93 

Qy 185 FFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

ii : | . - i mi MMMiiiiiihM mm 

Db 2 94 YFLYS KLS VDRNRWQQGTTFTCGVMHEALHNHYTQKNVS KNPGK 337 

RESULT 7 
Q8R3V9 

ID Q8R3V9 PRELIMINARY; PRT; 469 AA. 

AC Q8R3V9; 

DT 01-JUN-2002 (TrEMBLrel . 21, Created) 

DT 01-JUN-2002 (TrEMBLrel. 21, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Hypothetical 52.0 kDa protein. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 



OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus . 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Strausberg R.; 

RL Submitted (MAR-2002) to the EMBL/ GenBank/DDB J databases. 

DR EMBL; BC024405; AAH24405.1; -. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003006; Ig_MHC. 

DR InterPro; I PRO 03 5 96; Ig_v. 

DR Pfam; PF00047; ig; 4. 

DR SMART; SM00406; IGv; 1. 

DR PROSITE; PS50835; IG_LIKE; 4. 

DR PROSITE; PS00290; IG_MHC; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 469 AA; 51976 MW; 534793F155D05457 CRC64 ; 

Query Match 67.2%; Score 831.5; DB 11; Length 469; 

Best Local Similarity 63.8%; Pred. No. 3.5e-70; 

Matches 143; Conservative 41; Mismatches 35; Indels 5; Gaps 2; 

Qy 7 CPP--CPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVE 64 

II I I I : Ilhlllllll I h Ihllllllhl ih: i I || 

Db 249 CKPCICTVPEV SSVF I FPPKPKDVLTI TLTPKVTCVWDI SKDDPEVQFSWFVDDVE 305 

Qy 65 VHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQP 124 

II hllllllhllhl II I h , I; :!l , I llllllllll Ihl 

Db 306 VHTAQTKPREEQFNSTFRSVSELPIMHQDWLNGKEFKCRVNSAAFPAPIEKTISKTKGRP 365 

Qy 125 REPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGS 184 

: Mllhll :::: U hi II Mh I I I I III I l-hlll 

Db 366 KAPQVYT I P P PKEQMAKDKVSLTCM I TDFF P ED I TVEWQWNGQPAEN YKNTQP I MDTDGS 425 

Qy 185 FFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

II I II h II hllhll lllhhilll I I I I 
Db 426 YFVYSKLNVQKSNWEAGNTFTCSVLHEGLHNHHTEKSLSHSPGK 469 



RESULT 8 
Q99LC4 

ID Q99LC4 PRELIMINARY; PRT; 463 AA. 

AC Q99LC4; 

DT 01-JUN-2001 (TrEMBLrel. 17, Created) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Similar to RIKEN cDNA 1810060009 gene. 

GN IGH-4 . 

OS Mus musculus (Mouse) . 

OC Eukaryota,- Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Strausberg R.; 

RL Submitted (FEB-2001) to the EMBL/ GenBank/DDB J databases. 

DR EMBL; BC003435; AAH03435.1; -. 

DR HSSP; P01842; 7 FAB . 



DR MGD; MGI: 96446; Igh-4 . 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003006; Ig_MHC. 

DR InterPro; IPR003596; Ig_v. 

DR Pfam; PF00047; ig ; 4. 

DR SMART ; SM004 06; IGv; 1. 

DR PROSITE; PS50835; IG_LIKE; 4. 

DR PROSITE; PS00290; IG_MHC; 1. 

SQ SEQUENCE 463 AA; 51007 MW; EAA674C6BBC3 0783 CRC64 ; 

Query Match 66.9%; Score 828.5; DB 11; Length 463; 

Best Local Similarity 63.4%; Pred. No. 6.6e-70; 

Matches 142; Conservative 42; Mismatches 35; Indels 5; Gaps 2; 

Qy 7 CPP--CPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVE 64 

I" Ilhlllllll I h Ihllllllhl :|||hhhll II 

Db 243 CKPCICTVPEV S S VF I F P P KP KD VLT I TLT P KVTC VWD I S KDD P E VQFS W F VDD VE 2 99 

Qy 65 VHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAKGQP 124 

I I hhllllhllhl II I ■ !l i -I I- | I.I ||:| 

Db 300 VHTAQTQPREEQFNSTFRS VSELP I MHQDWLNGKEFKCRVNSAAF PAP I EKTI SKTKGRP 359 

Qy 125 REPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGS 184 

: I = = = : I ^ I I I I I - hi II Mh I I I I III I l-hlll 

Db 360 KAPQVYTI PPPKEQMAKDKVSLTCMI TDFFPEDI TVEWQWNGQPAENYKNTQP I MDTDGS 419 

Qy 185 FFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

I I II h II hllhll lllhhilll I I I I 

Db 42 0 YF I YS KLNVQKSNWEAGNTFTCS VLHEGLHNHHTEKSLSHS PGK 4 63 



RESULT 9 
Q9R1A4 

ID Q9R1A4 PRELIMINARY; PRT; 437 AA. 

AC Q9R1A4 ; 

DT 01-MAY-2000 (TrEMBLrel . 13, Created) 

DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Gamma 1 heavy chain of Mab7 (Fragment) . 

GN IGH-4. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Wilde K.G., Yu X., Ekramoddoullah A.K.M., Misra S.; 

RT "Cloning of cDNAs encoding for ant i -white pine blister rust monoclonal 

RT antibody (Mab 7, its light and heavy chains) and construction of a 

RT single chain antibody (scFV) ."; 

RL Submitted (MAY-1999) to the EMBL/ GenBank / DDB J databases. 

DR EMBL; AF152372; AAD40243.1; -. 

DR HSSP; P01842; 7 FAB . 

DR MGD; MGI: 96446; Igh-4. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003 006; Ig_MHC. 

DR InterPro; IPR003596; Ig_v. 



DR Pfam; PF00047; ig ; 4. 

DR SMART; SM00406; IGv; 1. 

DR PROSITE; PS50835; IG_LIKE; 4. 

DR PROSITE; PS00290; IG_MHC; 1. 

FT NON_TER 1 1 

FT NON_TER 437 437 

SQ SEQUENCE 437 AA; 48142 MW; 5C3A7BB3EE7D697C CRC64 ; 



Query Match 66.6%; Score 824.5; DB 11; Length 437; 

Best Local Similarity 62.9%; Pred. No. 1.5e-69; 

Matches 141; Conservative 43; Mismatches 35; Indels 5; Gaps 2; 

Qy 7 CPP- -CPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVE 64 

II I I I 5 llhlllllll I h Ihllllllhl llhl ,:| || 

Db 217 CKPCICTVPEV SSVFI FPPKPKDVLTITLTPKVTCVWDISKDDPEVQFSWFVDDVE 273 

Qy 65 VHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQP 124 

II hhllllhllhl II I i I I I ! : : | IMMIIIM ||:| 
Db 274 VHTAQTQPREEQFNSTFRSVSELPIMHQDWLNGKEFKCRVNSAAFPAPIEKTISKTKGRP 333 

Qy 125 REPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGS 184 

: MM ■■■■■■■ h:|lllh: hi II llh I I I I III I l-hlll 

Db 334 KAPQVYTIPPPKEQMAKDKVSLTCMITDFFPEDITVEWQWNGQPAENYKNTQPIMDTDGS 393 

Qy 185 FFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

:hllll I II I II hllhll lllhhhil I I I I 
Db 394 YFVYSKLNVQKSNWEAGNTFTCSVLHEGLHNHHTEKNLSHSPGK 437 



RESULT 10 
Q9D8L4 

ID Q9D8L4 PRELIMINARY; PRT; 473 AA. 

AC Q9D8L4; 

DT 01-JUN-2001 (TrEMBLrel. 17, Created) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE 1810060O09Rik protein. 

GN IGH-1 OR 1810060O09RIK. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N . A. 

RC STRAIN=C57BL/6J; TISSUE=Pancreas ; 

RX MEDLINE=21085660; PubMed=11217851 ; 

RA Kawai J., Shinagawa A., Shibata K. , Yoshino M. , Itoh M. , Ishii Y. , 

RA Arakawa T., Hara A., Fukunishi Y., Konno H., Adachi J., Fukuda S., 

RA Aizawa K. , Izawa M., Nishi K. , Kiyosawa H. , Kondo S., Yamanaka I., 

RA Saito T., Okazaki Y. , Goj obori T . , Bono H. , Kasukawa T. , Saito R., 

RA Kadota K. , Matsuda H.A. , Ashburner M. , Batalov S., Casavant T. , 

RA Fleischmann W., Gaasterland T. , Gissi C, King B. , Kochiwa H. , 

RA Kuehl P., Lewis S., Matsuo Y., Nikaido I., Pesole G., Quackenbush J., 

RA Schriml L.M., Staubli F., Suzuki R. , Tomita M. , Wagner L. , Washio T. , 

RA Sakai K. , Okido T. , Furuno M. , Aono H., Baldarelli R. , Barsh G. , 

RA Blake J . , Boffelli D. , Bojunga N. , Carninci P., de Bonaldo M.F., 

RA Brownstein M.J., Bult C, Fletcher C. , Fujita M. , Gariboldi M . , 



RA Gustincich S., Hill D. f Hofmann M. , Hume D.A., Kamiya M . , Lee N.H. , 

RA Lyons P., Marchionni L. , Mashima J., Mazzarelli J. , Mombaerts P., 

RA Nordone P., Ring B. , Ringwald M. , Rodriguez I., Sakamoto N. , 

RA Sasaki H. , Sato K. , Schoenbach C. , Seya T. , Shibata Y . , Storch K. -F. , 

RA Suzuki H., Toyo-oka K. , Wang K.H. , Weitz C. , Whittaker C. , Wilming L. , 

RA Wynshaw-Boris A., Yoshida K. , Hasegawa Y. , Kawa j i H. , Kohtsuki S., 

RA Hayashizaki Y.; 

RT "Functional annotation of a full-length mouse cDNA collection."; 

RL Nature 409:685-690(2001). 

DR EMBL; AK007918; BAB25349.1; 

DR HSSP; P01842; 7 FAB . 

DR MGD; MGI: 96443; Igh-1. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003006; IgJVIHC. 

DR InterPro; IPR003596; Ig_v. 

DR Pfam; PF00047; ig; 4. 

DR SMART; SM004 06; IGv; 1. 

DR PROSITE; PS50835; IG_LIKE; 4. 

DR PROSITE; PS00290; IG_MHC; 1. 

SQ SEQUENCE 473 AA; 51699 MW; 9DED57A514475FBB CRC64 ; 

Query Match 65.5%; Score 811; DB 11; Length 473; 

Best Local Similarity 64.0%; Pred. No. 3.1e-68; 

Matches 142; Conservative 38; Mismatches 42; Indels 0; Gaps 0; 

Qy 7 CPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVH 66 

I I I I Ihlllllllhllll II I I I I H MINI :|hh :|:|: II 
Db 252 CPPCAAPDLLGGPSVFIFPPKIKDVLMISLSPMVTCVVVDVSEDDPDVQISWFVNNVEVH 311 

Qy 67 NAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAKGQPRE 12 6 

hh | | | | | | | | | | : | J | | ^ : | | | : j 1 | | : | : j | j : | | | I 1 I [ :| 
Db 312 TAQTQTHREDYNSTLRWSALPIQHQDWMSGKEFKCKVNNRALPSPI EKTI SKPRGPVRA 371 

Qy 127 PQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFF 186 

I I I I III 'Ml = Ml II h = llhl I I III I llllllhl 

Db 372 PQVYVLPPPAEEMTKKEFSLTCMITGFLPAEIAVDWTSNGRTEQNYKNTATVLDSDGSYF 431 

Qy 187 LYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

i I II |::|::hll|:|| i I |::| | I 
Db 432 MYSKLRVQKSTWERGSLFACSWHEGLHNHLTTKTISRSLGK 4 73 



RESULT 11 
Q99L31 - 

ID Q99L31 PRELIMINARY; PRT; 468 AA. 

AC Q99L31; 

DT 01-JUN-2001 (TrEMBLrel. 17, Created) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Similar to RIKEN cDNA 1810060009 gene. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID-10090; 

RN [1] 

RP SEQUENCE FROM N . A . 

RA Strausberg R. ; 



RL Submitted (FEB-2001) to the EMBL/ GenBank/DDB J databases. 

DR EMBL; BC003878; AAH03878.1; -. 

DR HSSP; P01842; 7 FAB . 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003 006; Ig_MHC. 

DR InterPro; IPR003596; Ig_v. 

DR Pfam; PF00047; ig ; 3. 

DR SMART; SM00406; IGv; 1. 

DR PROSITE; PS50835; IG_LIKE; 4. 

DR PROSITE; PS00290; IG_MHC; 1. 

SQ SEQUENCE 468 AA; 51661 MW; 96352328B3332ADB CRC64 ; 



Query Match 65.0%; Score 805; DB 11; Length 468; 

Best Local Similarity 65.2%; Pred. No. l.le-67; 

Matches 146; Conservative 30; Mismatches 46; Indels 2; Gaps 1; 

Qy 7 CPP--CPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVE 64 

III I I I I lllllllhllll II I I I I I MIIIIIM : I I : I : :|:|: || 
Db 245 CPPCKCPAPNLLGGPSVFIFPPKIKDVLMISLSPMVTCVWDVSEDDPDVQISWFVNNVE 304 

Qy 65 VHNAKTKPREEQYNSTYRWSVXiTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAKGQP 124 

I hh I I I I I I I I I I : MI|::|||:||lh!IIIMIIhllll II 
Db 305 VIjTAQTQTHREDYNSTLRVVSALPI QHQDWMSGKEFKCKVNNKALPAPI ERTI SKPKGSV 364 

Qy 125 REPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGS 184 

I I I I I III :hll Ihllhl I I II III I I III I llllllll 

Db 365 RAPQWVLPPPEEEMTKKQVTLTCMVTDFMPEDIYVEWTNNGKTELNYKNTEPVLDSDGS 424 

Qy 185 FFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

I hi I : | :||||:|| lllhl || | III 
Db 425 YFMYSKLRVEKKNWVERNSYSCSWHEGLHNHHTTKSFSRTPGK 468 



RESULT 12 
Q99L25 

ID Q99L25 PRELIMINARY; PRT; 473 AA. 

AC Q99L25; 

DT 01-JUN-2001 (TrEMBLrel . 17, Created) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Similar to RIKEN cDNA 1810060009 gene. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Strausberg R.; 

RL Submitted (FEB-2001) to the EMBL / GenBank/ DDB J databases. 

DR EMBL; BC003888; AAH03888.1; -. 

DR HSSP; P01842; 7 FAB . 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003006; Ig_MHC. 

DR InterPro; IPR003596; Ig_v. 

DR Pfam; PF00047; ig ; 3. 

DR SMART; SM004 06; IGv; 1. 

DR PROSITE; PS50835; IG_LIKE; 4. 



DR PROSITE; PS00290; IG_MHC; 1. 

SQ SEQUENCE 473 AA; 52449 MW; BE9889B7986DA155 CRC64 ; 

Query Match 65.0%; Score 805; DB 11; Length 473; 

Best Local Similarity 65.2%; Pred. No. l.le-67; 

Matches 146; Conservative 30; Mismatches 46; Indels 2; Gaps 1; 
Qy 7 CPP--CPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVE 64 

III 1 1 1 1 lllllllhllll II Mil I lllllllll M l MM: || 

Db 250 CPPCKCPAPNLLGGPSVFIFPPKIKDVLMISLSPMVTCWVDVSEDDPDVQISWFVNNVE 309 

Qy 65 VHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAKGQP 124 

I hh I 1 1 1 1 1 1 1 1 I : lllhHIhlllhllllllllhllll II 

Db 310 VLTAQTQTHREDYNSTLRVVSALPI QHQDWMSGKEFKCKVNNKALPAPI ERTI SKPKGSV 369 

Qy 125 REPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGS 184 

I I I I I Ml :hl| Ihllhl I I II Ml I | Ml | IIIMIM 
Db 370 RAPQWVLPPPEEEMTKKQVTLTCMVTDFMPEDI YVEWTNNGKTELNYKNTEPVLDSDGS 42 9 

Qy 185 FFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

= 1=1111 hi I : I :||lhll lllhl II I UN 
Db 43 0 YFMYSKLRVEKKNWVERNSYSCSWHEGLHNHHTTKSFSRTPGK 473 



RESULT 13 
Q91Z05 

ID Q91Z05 PRELIMINARY; PRT; 473 AA. 

AC Q91Z05; 

DT 01-DEC-2001 (TrEMBLrel . 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Hypothetical 51.9 kDa protein. 

GN AU044919. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxI D= 1 0 0 9 0 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Strausberg R. ; 

RL Submitted (JUL-2001) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; BC010327; AAH10327.1; -. 

DR MGD; MGI: 2144967; AU044919. 

DR InterPro; IPR000345; CytC_heme_bind . 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003006; Ig_MHC. 

DR InterPro; IPR003596; Ig_v. 

DR Pfam; PF00047; ig ; 3. 

DR SMART; SM00406; IGv; 1. 

DR PROSITE; PS00190; CYTOCHROME_C ; 1. 

DR PROSITE; PS50835; IG_LIKE; 4. 

DR PROSITE; PS00290; IG_MHC; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 473 AA; 51946 MW; CF625F008 932AF12 CRC64 ; 



Query Match 

Best Local Similarity 



63.0%; Score 780; DB 11; Length 473; 
61.8%; Pred. No. 2.6e-65; 



Matches 141; Conservative 34; Mismatches 47; Indels 6; Gaps 1; 



Qy 7 CPP CPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCWVDVSHEDPEVKFNWYV 60 

Ml I I I I I lllllhlll II I I I I IhllMIIIII :|hh ! 

Db 246 CPPCKECHKCPAPNLEGGPSVFIFPPNIKDVLMISLTPKVTCVVVDVSEDDPDVQISWFV 305 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKA 120 

s I I I I hi: I I I I I I I I I I : ■ ^i. Ihllhllll 

Db 306 NNVEVHTAQTQTHREDYNSTIRWSALPIQHQDWMSGKEFKCKVNNKDLPSPIERTISKI 365 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

II I I I I I III ::h: lllllll II I Ihlll III I III I |||| 
Db 366 KGLVRAPQVYILPPPAEQLSRKDVSLTCLWGFNPGDISVEWTSNGHTEENYKDTAPVLD 425 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 22 8 

Mlhhllll : l = l = : = Ilhl II I hi =h:| I I I I 
Db 426 SDGSYFI YSKLDIKTSKWEKTDSFSCNWHEGLKNYYLKKTISRSPGK 473 



RESULT 14 
Q8R3H6 

ID Q8R3H6 PRELIMINARY; PRT; 474 AA. 

AC Q8R3H6; 

DT 01-JUN-2002 (TrEMBLrel . 21, Created) 

DT 01-JUN-2002 (TrEMBLrel. 21, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Hypothetical 51.7 kDa protein. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N . A . 

RA Strausberg R. ; 

RL Submitted (MAR-2002) to the EMBL / GenBank / DDB J databases. 

DR EMBL; BC025447; AAH25447.1; -. 

DR InterPro; IPR000345; CytC_heme_bind . 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003006; Ig_MHC. 

DR InterPro; IPR003596; Ig_v. 

DR Pfam; PF00047; ig ; 3. 

DR SMART; SM00406; IGv; 1. 

DR PROSITE; PS00190; CYTOCHROME_C ; 1. 

DR PROSITE; PS50835; IG_LIKE; 4. 

DR PROSITE; PS00290; IG_MHC; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 474 AA; 51748 MW; 8 608B57C6CD2874A CRC64 ; 

Query Match 63.0%; Score 780; DB 11; Length 474; 

Best Local Similarity 61.8%; Pred. No. 2.6e-65; 

Matches 141; Conservative 34; Mismatches 47; Indels 6; Gaps 1; 

Qy 7 CPP CPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

Ml I I I I I lllllhlll II I I I I IhllMIIIII : I I : I : 

Db 247 CPPCKECHKCPAPNLEGGPSVFIFPPNIKDVLMISLTPKVTCVWDVSEDDPDVQISWFV 306 



Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 



: I I I I hh I I I I I I I I I I : lll|::||hlMhll Ihllhllll 
Db 307 NNVEVHTAQTQTHREDYNSTI RWSALP I QHQDWMSGKEFKCKVNNKDLPSPI ERTI SKI 366 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

M I M I I III ::h: lllllll II I Ihlll III I III I II 
Db 367 KGLVRAPQW I LP PPAEQLSRKDVSLTCLWGFNPGDIS VEWTSNGHTEENYKDTAPVLD 426 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

lllhhllll : hh = : Ilhl II I hi :|::| I I I I 
Db 427 SDGSYFI YSKLDIKTSKWEKTDSFSCNVRHEGLKNYYLKKTISRSPGK 474 

RESULT 15 
Q9BSZ1 

ID Q9BSZ1 PRELIMINARY; PRT; 375 AA. 

AC Q9BSZ1; 

DT 01-JUN-2001 (TrEMBLrel . 17, Created) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Hypothetical protein. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Lymph; 

RA Strausberg R.; 

RL Submitted (MAR-2001) to the EMBL/ GenBank / DDB J databases. 

DR EMBL; BC004476; AAH04476.1; -. 

DR HSSP; P01857; 1FC1 . 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003597; Ig_cl. 

DR InterPro; IPR003006; Ig_MHC. 

DR Pfam; PF00047; ig ; 3. 

DR SMART; SM00407; IGcl; 2. 

DR PROSITE; PS50835; IG_LIKE; 3. 

DR PROSITE; PS00290; IG_MHC; 3. 

KW Hypothetical protein. 

SQ SEQUENCE 375 AA; 41314 MW; B1A0A09 98F473619 CRC64 ; 

Query Match 28.9%; Score 358; DB 4; Length 375; 

Best Local Similarity 34.1%; Pred. No. 1.5e-25; 

Matches 73; Conservative 48; Mismatches 85; Indels 8; Gaps 6; 
Qy 21 VFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNST 80 



" " I I 11*11"* I I * I I I I I : I 

Db 146 VFAI PPS - FAS I FLTKSTKLTCLVTDLTT YD - S VTI SWTRQNGEAVKTHTN I S ESHPNAT 203 

Qy 81 YRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAKGQP -REPQVYTLPPSRDEL 139 

' I =: M =1= = I h= Ihh-llh II I II |||:|::| 

Db 204 FSAVGEASICEDDWNSGERFTCTVTHTDLPSPLKQTISRPKGVALHRPDVYLLPPAREQL 263 

Qy 14 0 T-KNQVSLTCLVKGFYPSDIAVEWESNGQP- -ENNYKTTPPVLD- -SDGSFFLYSKLTVD 194 

= MM || hh hi III I h h : : | : | : | Ml 

Db 264 NLRESATITCLVTGFSPADVFVQWMQRGQPLSPEKYVTSAPMPEPQAPGRYFAHSILTVS 323 



Qy 195 KSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

: I I ::| I I I I I I I II 

Db 324 EEEWNTGETYTCWAHEALPNRVTERTVDKSTGK 357 



Search completed: January 16, 2004, 09:38:45 
Job time : 38 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



Title: 

Perfect score : 
Sequence : 

Scoring table: 



January 16, 2004, 09:36:26 ; Search time 17 Seconds 

( wi thout a 1 ignment s ) 
630.711 Million cell updates/sec 

US-09-843-221A-2 
1238 

1 MDKTHTCPPCPAPELLrGGPS MHEALHNHYTQKSLSLSPGK 228 

BL0SUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 127863 seqs, 47026705 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



127863 



Database : 



SwissProt 41:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



Query 



NO. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


1233 


99 


.6 


330 


1 


GC1_HUMAN 


P01857 


homo sap i en 


2 


1142.5 


92 


.3 


326 


1 


GC2_HUMAN 


P01859 


homo sap i en 


3 


1135 


91 


.7 


327 


1 


GC4_HUMAN 


P01861 


homo sapien 


4 


1126 


91 


.0 


290 


1 


GC3_HUMAN 


P01860 


homo sapien 


5 


918.5 


74 , 


.2 


323 


1 


GC_RABIT 


P01870 


oryctolagus 


6 


889 


71, 


.8 


329 


1 


GC2_CAVP0 


P01862 


cavia porce 


7 


845.5 


68 


.3 


329 


1 


GC3JV10USE 


P22436 


mus musculu 


8 


838 


67, 


.7 


333 


1 


GCB_RAT 


P20761 


rattus norv 


9 


834.5 


67. 


.4 


398 


1 


GC3M_MOUSE 


P03987 


mus musculu 


10 


818.5 


66. 


.1 


326 


1 


GC1_RAT 


P20759 


rattus norv 


11 


817.5 


66, 


.0 


324 


1 


GC1JVIOUSE 


P01868 


mus musculu 


12 


812 . 5 


65. 


.6 


393 


1 


GC1M MOUSE 


P01869 


mus musculu 


13 


809.5 


65, 


.4 


329 


1 


GCC_RAT 


P20762 


rattus norv 


14 


809 


65. 


.3 


330 


1 


GCAA_MOUSE 


P01863 


mus musculu 


15 


804 


64. 


.9 


399 


1 


GCAM_MOUSE 


P01865 


mus musculu 


16 


802 


64. 


. 8 


335 


1 


. GCAB_MOUSE 


P01864 


mus musculu 


17 


785.5 


63 . 


.4 


322 


1 


GCA_RAT 


P20760 


rattus norv 



18 


779 


62 . 


. 9 


336 


1 


GCB MOUSE 


P01 fififi 


miiQ mncnil n 

ILIUO UIL1 O ^ L1 X LI 


19 


774 


62 , 


. 5 


405 


1 


GOBM MOTIVE 


P01 Afi7 
x \J j. o o / 


mi 1 0 mnGrnl n 

ILLLiO ILLLlo L* L1X Li. 


20 


358 


28 . 


. 9 


454 


1 


MUC HUMAN 


P01871 




21 


357 


28 


. 8 


455 


1 


MUC MOUSE 


P01872 


miiq mi ]<3 mil 11 


22 


356 


28 


. 8 


391 


1 


MTTPR HUMAN 


pAil 9 9 0 


nuinu fa dpi ell 


23 


354 


28 , 


. 6 


421 


1 


EPC MOTIVE 


r VD J j D 


mi 1 c mi icmil n 
UlUb IllLlb UxU 


24 


353 


28 , 


. 5 


429 


1 


EPC RAT 


X \J X. O *J *J 


lauLuo nui v 


25 


352 


28 , 


. 4 


428 


1 


EPC HUMAN 


pni ft 


V» /^TTIfl aar\i on 

iiLJLiiu sapicii 


26 


347 


28 , 


. 0 


476 


1 


MUCM MOIJCJE 


PHI ft7^ 

x U -L O / J 


mi 1 c mn q pi 1 1 it 


27 


342 


27, 


. 6 


458 


1 


MUC RABIT 


P03988 


DTVPtTll ^CTllC! 

<w/x y v» w \J X O^J LID 


28 


332 


26 . 


. 8 


450 


1 


MI IP PANFA 


PHI 074 


L-allio J-dlulx 


29 


332 


26 . 


. 8 


454 


1 


MUC MESAU 


P06337 


UlCiSULi 1LC L. U 


30 


332 


26 , 


. 8 


479 


1 


MUCM RABIT 


P04221 


0 rvr hnl a m i q 
uijfL Luiay Uo 


31 


329 


26 . 


. 6 


457 


1 


MUC SUNMU 


P?07fift 
r / 0 0 


oLlllL^Llo lllUX 1 


32 


297 


24 , 


. 0 


438 


1 


HVP2 HETFR 


x Zi j U O «J 


JicLcI ULiLJJi L. U. 


33 


295.5 


23 


, 9 


299 


1 


AT.P RARTT 


PHI 070 


uz yc L,tJ±a.y LIS 


34 


287 . 5 


23 . 


. 2 


438 


1 


HVCS HETFR 


x £1 »J W O / 


Vi oh prn^nn hn 

11C LCI LJL1LJ11 U LI 


35 


287 . 5 


23 . 


. 2 


446 


1 


MUC CHICK 


PD1 A7R 

x \j X O / J 


rja 1 1 110 nal "1 
yaii ud yaii 


36 


278 . 5 


22 . 


, 5 


461 


1 


HVPM HETFR 


P97 no 0 


IlcLcI vJUCJIlL. U 


37 


276 . 5 


22 . 


. 3 


393 


1 


HVC3 HETFR 

XXV V_ X X J—J J. i> i\ 


j uo o 


11C LCI 1 L- LI 


38 


272 


22 . 


. 0 


353 


1 


ALC1 HUMAN 

*XX— J\_» -1- X 1U1 ulii 


pm A7fi 

x VJ X O / D 


IlUlULJ 0 d|J X t-Il 


39 


272 


22 , 


. 0 


370 


1 


HVP1 HETFR 

1 1 v V_ X. 1 I XJ X X i\ 


DOT Oft 4 
X £a J U O *± 


IlcLcI UUtJIlL. Ll 


40 


267.5 


21. 


.6 


353 


1 


ALC1~G0RG0 


P20758 


gorilla gor 


41 


267 


21. 


.6 


340 


1 


ALC2_HUMAN 


P01877 


homo sapien 


42 


248 .5 


20. 


. 1 


481 


1 


MUCM ICTPU 


P23735 


ictalurus p 


43 


245 


19. 


.8 


344 


1 


ALC_M0USE 


P01878 


mus musculu 


44 


179 


14 . 


. 5 


513 


1 


SHS1_M0USE 


P97797 


m protein-t 


45 


177 . 5 


14 . 


.3 


105 


1 


LAC1 MOUSE 


P01843 


mus musculu 



ALIGNMENTS 



RESULT 1 
GC1_HUMAN 

ID GC1_HUMAN STANDARD; PRT; 33 0 AA. 

AC P01857; 

DT 21-JUL-1986 (Rel . 01, Created) 

DT 21-JUL-1986 (Rel. 01, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Ig gamma- 1 chain C region. 

GN IGHG1 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata,- Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=82274238; PubMed=628 7432 ; 

RA Ellison J.W., Berson B. J. , Hood L.E.; 

RT "The nucleotide sequence of a human immunoglobulin C gammal gene." ; 

RL Nucleic Acids Res. 10:4071-4079(1982). 

RN [2] 

RP SEQUENCE OF 1-135 (MYELOMA PROTEIN EU) . 

RX MEDLINE=71064024; PubMed=54 8 9771 ; 

RA Cunningham B.A., Rutishauser U. , Gall W.E., Gottlieb P.D., 

RA Waxdal M.J., Edelman G.M. ; 



RT "The covalent structure of a human gamma G- immunoglobulin. VII. Amino 

RT acid sequence of heavy-chain cyanogen bromide fragments H1-H4."; 

RL Biochemistry 9:3161-3170(1970). 

RN [3] 

RP SEQUENCE OF 136-329 (EU) . 

RX MEDLINE=71064025; PubMed=553 0842 ; 

RA Rutishauser U., Cunningham B.A., Bennett C. , Konigsberg W.H., 

RA Edelman G.M. ; 

RT "The covalent structure of a human gamma G- immunoglobulin. 8. Amino 

RT acid sequence of heavy-chain cyanogen bromide fragments H5-H7." ; 

RL Biochemistry 9:3171-3181(1970). 

RN [4] 

RP SEQUENCE (MYELOMA PROTEIN NIE) . 

RX MEDLINE=77070269; PubMed=826475 ; 

RA Ponstingl H. , Hilschmann N . ; 

RT "The rule of antibody structure. The primary structure of a 

RT monoclonal IgGl immunoglobulin (myeloma protein Nie) . III. The 

RT chymotryptic peptides of the H-chain, alignment of the tryptic 

RT peptides and discussion of the complete structure."; 

RL Hoppe-Seyler's Z. Physiol. Chem. 357:1571-1604(1976). 

RN [5] 

RP SEQUENCE (MYELOMA PROTEIN KOL) , AND DISULFIDE BONDS. 

RX MEDLINE=83289131; PubMed=6884 994 ; 

RA Schmidt W.E., Jung H.-D., Palm W. , Hilschmann N. ; 

RT "Three-dimensional structure determination of antibodies. Primary 

RT structure of crystallized monoclonal immunoglobulin IgGl KOL, I."; 

RL Hoppe-Seyler's Z. Physiol. Chem. 364:713-747(1983). 

RN [6] 

RP DISULFIDE BONDS. 

RX MEDLINE=71064027; PubMed=4923144 ; 

RA Gall W.E., Edelman G.M.; 

RT "The covalent structure of a human gamma G- immunoglobulin. X. 

RT Intrachain disulfide bonds."; 

RL Biochemistry 9:3188-3196(1970). 

RN [7] 

RP DISULFIDE BONDS. 

RX MEDLINE=77070267; PubMed=100212 9 ; 

RA Dreker L. , Schwarz J. , Reichel W. , Hilschmann N. ; 

RT "Rule of antibody structure. The primary structure of a monoclonal 

RT IgGl immunoglobulin (myeloma protein Nie), I: Purification and 

RT characterization of the protein, the L- and H-chains, the 

RT cyanogen bromide cleavage products, and the disulfide bridges."; 

RL Hoppe-Seyler's Z. Physiol. Chem. 357:1515-1540(1976). 

RN [8] 

RP X-RAY CRYSTALLOGRAPHY (2.9 ANGSTROMS). 

RX MEDLINE=81208100; PubMed=7236608 ; 

RA Deisenhofer J. ; 

RT "Crystallographic refinement and atomic models of a human Fc fragment 

RT and its complex with fragment B of protein A from Staphylococcus 

RT aureus at 2.9- and 2. 8 -A resolution."; 

RL Biochemistry 20:2361-2370(1981). 

CC -!- MISCELLANEOUS: NIE HAS THE G1M(17) ALLOTYPIC MARKER, 97-K, & THE 
CC G1M(1) MARKERS, 239-D & 241-L. KOL & EU SEQUENCES HAVE THE G1M(3) 

CC MARKER 6c THE G1M (NON-1) MARKERS. 

CC -!- MISCELLANEOUS: NIE ALSO DIFFERS IN THE AMIDATION STATES OF 
CC 35,116,198,269 & 272. 

CC -!- MISCELLANEOUS: EU ALSO DIFFERS IN THE AMIDATION STATES OF RESIDUES 



cc 




155, 166, 177, 195, 198, 269, AND 272 AND IN THE ORDER OF RESIDUES 


cc 




268-272 . 


cc 


- ! - 


MISCELLANEOUS: KOL ALSO DIFFERS IN THE AMIDATION STATES OF 


cc 




RESIDUES 198,267&272. 


cc 
cc 








This SWISS-PROT entry is copyright. It is produced through a collaboration 


cc 


between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 


cc 


the 


European Bioinf ormat ics Institute. There are no restrictions on its 


cc 


use 


by non-profit institutions as long as its content is in no way 


cc 


modified and this statement is not removed. Usage by and for commercial 


cc 


entities requires a license agreement (See http://www.isb-sib.ch/announce/ 


cc 
cc 

DR 


or send an email to license@isb-sib . ch) . 
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DR 


PDB 
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DR 


PDB 
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DR 


PDB 


* 1AJ7 
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DR 


PDB 


■ 1D5B 


■ 09-FEB-00. 


DR 


PDB 


• 1D5I 


• 09-FEB-00. 


DR 


PDB 


• 1D6V 


■ 04-OCT-00. 


DR 


PDB 


• 1DN2 


17-MAY-OO. 


DR 


PDB 


• 1E4K 


06-JUN-01. 


DR 


PDB 


1FCC 


20-JUL-95. 


DR 


PDB 


1HZH 


12-JUN-02 . 


DR 


PDB, 


1I7Z, 


08-AUG-01 . 


DR 


PDB, 


HIS, 


16-MAY-01 . 


DR 


PDB, 


1IIX, 


16-MAY-01. 


DR 


PDB, 


1L6X, 


10-APR-02 . 


DR 


PDB, 


2RCS, 


12-NOV-97. 


DR 


Genew; HGNC:5525; IGHG1 . 


DR 


MIM; 147100; -. 


DR 


GO; 


GO: 0005624; C : membrane fraction; NAS . 


DR 


GO; 


GO: 0003823; F: antigen binding activity; TAS . 


DR 


GO; 


GO: 0006955; P : immune response; NAS. 


DR 


InterPro; 


IPR007110; Ig-like. 


DR 


InterPro; 


IPR003597; Ig_cl. 


DR 


InterPro; 


IPR003006; Ig MHC. 


DR 


Pfam; PF00047; ig ; 3. 


DR 


SMART; SM00407; IGcl; 2. 


DR 


PROSITE; PS50835; IG LIKE; 3. 


DR 


PROSITE; PS00290; IGJV1HC; 2. 


KW 


Immunoglobulin domain; Immunoglobulin C region; Glycoprotein; 


KW 


3D-structure . 


FT 
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1 1 


FT 


DOMAIN 


1 98 CHI. 


FT 


DOMAIN 


99 110 HINGE. 


FT 


DOMAIN 
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FT 


DOMAIN 


224 330 CH3 . 


FT 


DISULFID 


27 83 


FT 


DISULFID 


103 103 INTERCHAIN (WITH LIGHT CHAIN) . 
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DISULFID 
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DISULFID 
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DISULFID 


144 204 


FT 


DISULFID 


250 308 


FT 
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M0D_ 


RES 


330 330 REMOVED POST- TRANS LAT I ONALLY . 
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TURN 
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FT 


STRAND 
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FT 


TURN 


313 


314 


FT 


TURN 


316 


317 


FT 


STRAND 


320 
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SQ 


SEQUENCE 


330 AA; 


36 



K -> R (IN G1M(3) MARKER) . 

/FTId=VAR_003886. 

D -> E (IN GlM(NON-l) MARKER). 

/FTId=VAR_003887. 

L -> M (IN GlM(NON-l) MARKER). 

/FTId=VAR_003888 . 



MW; 3770EE106C2FA33D CRC64 ; 



Query Match 99.6%; Score 1233; DB 1; Length 330; 

Best Local Similarity 100.0%; Pred. No. 2.2e-93; 

Matches 227; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


2 


DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEvTCWVDVSHEDPEVKFNWYVD 61 






IIMIIIIIIIIMIIIIIIIIIIIIIIIMIMIIIIIMIMIIMIIIIIIIMIII 


Db 


104 


DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVD 163 


Qy 


62 


GVE VHNAKTKPRE EQ YNS TYR WS VLTVLHQDWLNGKE YKCKVSNKAL PA P I EKT I SKAK 121 






IMIIIIIMMIIIIIIIIIIIIMIIIIIIIIIIIIIIIIMIIIIIMIIIIMIII 


Db 


164 


GVEVHNAKTKPREEQYNSTYRWS VLTVLHQDWLNGKE YKCKVSNKALPAP I EKT I SKAK 223 


Qy 


122 


GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 181 






IIMIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIII 


Db 


224 


GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 283 


Qy 


182 


DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 



1 II II Ml I III MIIMII III III II I MM 

Db 284 DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 33 0 

RESULT 2 
GC2_HUMAN 

ID GC2_HUMAN STANDARD; PRT; 326 AA. 

AC P01859; 

DT 21-JUL-1986 (Rel. 01, Created) 

DT 21-JUL-1986 (Rel. 01, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE Ig gamma -2 chain C region. 

GN IGHG2. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa,- Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria,- Primates,- Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606 ; 

RN [1] 

RP SEQUENCE OF 2-326 FROM N.A. 

RX MEDLINE=82197621; PubMed=68 04 94 8 ; 

RA Ellison J. W., Hood L.E.; 

RT "Linkage and sequence homology of two human immunoglobulin gamma 

RT heavy chain constant "region genes."; 

RL Proc. Natl. Acad. Sci. U.S.A. 79:1984-1988(1982). 

RN [2] 

RP SEQUENCE OF 88-115 FROM N.A. 

RC TISSUE=Fetal liver; 

RX MEDLINE=83001943; PubMed=68 11139 ; 

RA Takahashi N. , Ueda S., Obata M. , Nikaido T. , Nakai S., Honjo T. ; 

RT "Structure of human immunoglobulin gamma genes: implications for 

RT evolution of a gene family."; 

RL Cell 29:671-679(1982). 

RN [3] 

RP SEQUENCE OF 99-177 AND 310-326 FROM N.A. 

RC TISSUE=Fetal liver; 

RX MEDLINE=84235992; PubMed=632 9676 ; 

RA Krawinkel U. , Rabbitts T.H. ; 

RT "Comparison of the hinge-coding segments in human immunoglobulin gamma 

RT heavy chain genes and the linkage of the gamma 2 and gamma 4 subclass 

RT genes . " ; 

RL EMBO J. 1:403-407(1982). 

RN [4] 

RP SEQUENCE OF 1-325 (MYELOMA PROTEIN TIL) . 

RX MEDLINE=81007873; PubMed=6774 012 ; 

RA Wang A.-C, Tung E. , Fudenberg H.H. ; 

RT "The primary structure of a human IgG2 heavy chain: genetic, 

RT evolutionary, and functional implications."; 

RL J. Immunol. 125:1048-1054(1980). 

RN [5] 

RP SEQUENCE OF 1-85 AND 132-325 (MYELOMA PROTEIN ZIE) . 

RX MEDLINE=80001357; PubMed=113060 ; 

RA Connell G.E., Parr D.M., Hofmann T. ; 

RT "The amino acid sequences of the three heavy chain constant region 

RT domains of a human IgG2 myeloma protein."; 

RL Can. J. Biochem. 57:758-767(1979). 

RN [6] 

RP SEQUENCE OF 238-275 (ZIE) . 



RX MEDLINE=80114419; PubMed=118 92 0 ; 

RA Hofmann T. , Parr D.M.; 

RT "A note of the amino acid sequence of residues 381-391 of human 

RT immunoglobulins gamma chains."; 

RL Mol. Immunol. 16:923-925(1979). 

RN [7] 

RP REVISIONS TO 25; 59; 60 AND 264-268 (2IE) . 

RA Hofmann T. , Parr D.M. ; 

RL Submitted (MAR-1980) to the PIR data bank. 

RN [8] 

RP SEQUENCE OF 1-121 (DOT) . 

RX MEDLINE=95255298; PubMed=7737190 ; 

RA Stoppini M. , Bellotti V., Negri A., Merlini G. , Garver F. , Ferri G. ; 

RT "Characterization of the two unique human anti-flavin monoclonal 

RT immunoglobul ins . " ; 

RL Eur. J. Biochem. 228:886-893(1995). 

RN [9] 

RP DISULFIDE BONDS. 

RX MEDLINE=72033500; PubMed=4 94 0472 ; 

RA Mil stein C. , Frangione B. ; 

RT "Disulphide bridges of the heavy chain of human immunoglobulin G2 . 11 ; 

RL Biochem. J. 121:217-225(1971). 

RN [10] 

RP DISULFIDE BONDS. 

RX MEDLINE=69064124 ; PubMed=5782707 ; 

RA Frangione B., Milstein C. , Pink J.R.L.; 

RT "Structural studies of immunoglobulin G."; 

RL Nature 221:145-148(1969). 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformatics and the EMBL outstation - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; J00230; AAB593 93.1; -. 

DR PIR; A93906; G2HU . 

DR HSSP; P01857; 1FC1 . 

DR Genew; HGNC:5526; IGHG2 . 

DR MIM; 147110; -. 

DR GO; GO: 0005624; C:membrane fraction; NAS. 

DR GO; GO: 0003823; F: antigen binding activity; TAS . 

DR GO; GO: 0006955; P : immune response; NAS. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003597; Ig_cl. 

DR InterPro; IPR003006; Ig_MHC. 

DR Pfam; PF00047; ig ; 3. 

DR SMART; SM00407; IGcl; 2. 

DR PROSITE; PS50835; IG_LIKE; 3. 

DR PROSITE; PS00290; IG_MHC; 2. 

KW Immunoglobulin domain; Immunoglobulin C region. 

FT NON_TER 

FT DOMAIN 

FT DOMAIN 

FT DOMAIN 
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PT 
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1 U D 


INTERCHAIN (WITH A HEAVY CHAIN) . 




FT 


DT QTTT.PTD 


-L V J 




INTERCHAIN (WITH A HEAVY CHAIN) . 




PT 






9 n n 

z u u 






FT 


DISULFID 


246 


304 






FT 


SITE 


156 


156 


AT OR NEAR THE COMPLEMENT -BINDING 


SITE. 


FT 


MOD_RES 


326 


326 


REMOVED POST- TRANS LAT I ONALLY (PROBABLE) 


FT 


VARIANT 


60 


60 


S -> A (IN MYELOMA PROTEINS TIL & 


ZIE) . 


FT 








/FTId=VAR_003889 . 




FT 


CONFLICT 


109 


109 


C -> S (IN REF. 3) . 




SQ 


SEQUENCE 


326 AA; 


35884 


MW; 8310878C6878CF9C CRC64 ; 





Query Match 92.3%; Score 1142.5; DB 1; Length 326; 

Best Local Similarity 94.1%; Pred. No. 5e-86; 

Matches 209; Conservative 8; Mismatches 4; Indels 1; Gaps 



l; 



Qy 

Db 



106 



CPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEWCVVVDVSHEDPEVKFNWYVDGVEVH 66 

IIIMII : llllllllllllllllllllllllllllllllllllhlllllllllll 

CPPCPAPP-VAGPSVFLFPPKPKDTIJyilSRTPEWC^AA/DVSHEDPEVQFNWYvTJGVEWI 164 



Qy 
Db 

Qy 
Db 

Qy 

Db 



67 NAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAKGQPRE 126 

IIIIMIIIhllhlllllllhlMIIIIIIIIIIIIII MINI, IIMM 

165 NAKTKPREEQFNSTFRWSVLTVVHQDWLNGKEYKCKVSNKGLPAPI EKTI SKTKGQPRE 224 
127 PQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFF 186 

lllllllllhhMIIIIIIIIIIIIIIIIIIIIMIIMIIIIIMIIhllllllll 

225 PQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPMLDSDGSFF 284 



187 



228 



LYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 

I > i i : : ! : 1 1 i ; I i : 

285 LYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 326 



RESULT 3 
GC4_HUMAN 

ID GC4_HUMAN STANDARD; PRT; 327 AA. 

AC P01861; 

DT 21-JUL-1986 (Rel. 01, Created) 

DT 21-JUL-1986 (Rel. 01, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Ig gamma -4 chain C region. 

GN IGHG4 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=83157104; PubMed=62 99662 ; 

RA Ellison J.W., Buxbaum J.N. , Hood L.E.; 

RT "Nucleotide sequence of a human immunoglobulin C gamma 4 gene."; 

RL DNA 1:11-18 (1981) . 

RN [2] 



RP SEQUENCE OF 1-30 AND 81-326. 

RX MEDLINE=70207560; PubMed=4 192699 ; 

RA Pink J.R.L., Buttery S.H., de Vries G.M., Milstein C. ; 

RT "Human immunoglobulin subclasses. Partial amino acid sequence of the 

RT constant region of a gamma 4 chain."; 

RL Biochem. J. 117:33-47(1970). 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 



T"\T~") 

UK 


EMBL; K01316; AAB593 94.1; ALT INIT. 




UK 


DTD . AQAQOO. f~* A TJT T 

rlK; Ayuyjj ; Cj4HU. 




UK 


PUB; IAUQ; 16 -SEP- 98. 




Fin 






pip 
UK 






UK 


bO; uO: 0005624; C : membrane fraction; NAS . 




UK 


(jU; <jU: 00 03 8^3; F: antigen cmaing activity; TAS. 




DR 


GO; GO: 0006955; P : immune response; NAS. 




DR 


InterPro; IPR007110; Ig-like. 




DR 


InterPro; IPR003597; Ig_cl. 




DR 


InterPro; IPR003006; Ig MHC. 




DR 


Pfam; PF00047; ig ; 3. 




DR 


SMART; SM004 07; IGcl; 2. 




DR 


PR0SITE; PS50835; IG LIKE; 3. 




DR 


PROSITE; PS00290; IG_MHC; 2. 




KW 


Immunoglobulin domain; Immunoglobulin C region; 


3D-structure 


FT 


N0N_TER 1 1 




FT 


DOMAIN 1 98 CHI. 




FT 


DOMAIN 99 110 HINGE. 




FT 


DOMAIN 111 22 0 CH2 . 




FT 


DOMAIN 221 327 CH3 . 




FT 


DISULFID 14 14 INTERCHAIN (WITH A 


LIGHT CHAIN) , 


FT 


DISULFID 27 83 




FT 


DISULFID 106 106 INTERCHAIN (WITH A 


HEAVY CHAIN) . 


FT 


DISULFID 109 109 INTERCHAIN (WITH A 


HEAVY CHAIN) , 


FT 


DISULFID 141 201 




FT 


DISULFID 247 305 




SQ 


SEQUENCE 327 AA; 35940 MW; 3EDBD811EF208E7A 


CRC64; 


Query Match 91.7%; Score 113 5; DB 1; 


Length 327; 


Best Local Similarity 93.7%; Pred. No. 2.1e-85; 





Matches 208 ; Conservative 8 ; Mismatches 6 ; Indels 0 ; Gaps 0 ; 
Qy 7 CPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVH 66 

II Mill lllllllllllllllllllllllllllllllll II Mill III, 

Db 106 CPSCPAPEFLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSQEDPEVQFNWYVDGVEVH 165 

QY 67 NAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAKGQPRE 126 

Mlllllllhllllllllllllllllllllllllllllll Ih Mill II,. 

Dt) 166 NAKTKPREEQFNSTYRWSVLTVLHQDWLNGKEYKCKVSNKGLPSS I EKTI SKAKGQPRE 225 



Qy 127 PQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFF 186 



IIIIIIMh:|:|||||IMIIIIIIIIMIMIMIIIIIIIIIIIIIIIIIIIIIII 
Db 226 PQVYTLPPSQEEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFF 285 

Qy 187 LYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

MhllllllllhMIIIIIIIIMIIIIIIIMIIM II 

Db 286 LYSRLTVDKSRWQEGNVFSCSVMHEALHNHYTQKSLSLSLGK 327 

RESULT 4 
GC3_HUMAN 

ID GC3_HUMAN STANDARD; PRT; 290 AA. 

AC P01860; 

DT 21-JUL-1986 (Rel . 01, Created) 

DT 21-JUL-1986 (Rel. 01, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Ig gamma-3 chain C region (Heavy chain disease protein) (HDC) . 

GN IGHG3 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE (DISEASE PROTEIN WIS) . 

RX MEDLINE=81021548; PubMed=6774747 ; 

RA Frangione B., Rosenwasser E . , Prelli F. f Franklin E.C.; 

RT "Primary structure of human gamma 3 immunoglobulin deletion mutant: 

RT gamma 3 heavy-chain disease protein Wis."; 

RL Biochemistry 19:4304-4308(1980). 

RN [2] 

RP REVISIONS TO 12-97 (PROTEIN WIS) . 

RX MEDLINE=77118561; PubMed=4 02363 ; 

RA Michaelsen T.E., Frangione B., Franklin E.C.; 

RT "Primary structure of the 'hinge' region of human IgG3 . Probable 

RT quadruplication of a 15-amino acid residue basic unit."; 

RL J. Biol. Chem. 252:883-889(1977). 

RN [3] 

RP REVISIONS TO 59-289 (PROTEIN WIS/DISEASE PROTEIN ZUC) . 

RX MEDLINE=77021516; PubMed=823 945 ; 

RA Wolfenstein-Todel C. , Frangione B., Prelli F., Franklin B.C.; 

RT "The amino acid sequence of 'heavy chain disease 1 protein ZUC. 

RT Structure of the Fc fragment of immunoglobulin G3 . " ; 

RL Biochem. Biophys. Res. Commun. 71:907-914(1976). 

RN [4] 

RP SEQUENCE FROM N.A. (DISEASE PROTEIN OMM) . 

RX MEDLINE=82247835; PubMed=68 08505 ; 

RA Alexander A., Steinmetz M. , Barritault D. , Frangione B., 

RA Franklin B.C., Hood L. , Buxbaum J.N.; 

RT "Gamma Heavy chain disease in man: cDNA sequence supports partial 

RT gene deletion model."; 

RL Proc. Natl. Acad. Sci . U.S.A. 79:3260-3264(1982). 

CC -!- SUBUNIT: DIMER LINKED BY 12 DISULFIDE BONDS; IT HAS AN EXTRA 
CC INTERCHAIN DISULFIDE BOND AT POSITION 7 IN ADDITION TO THE 11 

CC NORMALLY PRESENT IN THE HINGE REGION. 

CC -!- MISCELLANEOUS: THE HEAVY CHAIN DISEASE PROTEIN WIS IS SHOWN. 

CC -!- MISCELLANEOUS: THE SEQUENCE OF RESIDUES 42-76 WAS TAKEN FROM THE 

CC REF . 2 . 

CC -!- MISCELLANEOUS: DISEASE PROTEIN WIS IS LACKING MOST OF THE V REGION 



CC AND ALL OF THE CHI REGION. 

CC -!- MISCELLANEOUS: DISEASE PROTEIN ZUC LACK MOST OF THE V REGION, ALL 
CC OF THE CHI REGION, AND PART OF THE HINGE COMPARED WITH NORMAL 

CC GAMMA -3 HEAVY CHAINS. 

CC -!- MISCELLANEOUS: DISEASE PROTEIN OMM MAY REPRESENT AN ALLELIC FORM 
CC OR ANOTHER GAMMA CHAIN SUBCLASS. 

CC -!- MISCELLANEOUS: THE HINGE REGION IN GAMMA -3 CHAINS IS ABOUT FOUR 
CC TIMES AS LONG AS IN OTHER GAMMA CHAINS AND CONTAINS THREE 

CC IDENTICAL 15-RESIDUE SEGMENTS PRECEDED BY A SIMILAR 17-RESIDUE 

CC SEGMENT (12-28) . 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformatics and the EMBL outstation - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; J00231; AAA52805.1; ALT_SEQ . 

DR HSSP; P01857; 1FC1 . 

DR Genew; HGNC:5527; IGHG3 . 

DR MIM; 147120; -. 

DR GO; GO: 0005624; C:membrane fraction; NAS. 

DR GO; GO: 0003823; F: antigen binding activity; TAS . 

DR GO; GO: 0006955; P : immune response; NAS . 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003597; Ig_cl . 

DR InterPro; IPR003006; Ig_MHC. 

DR Pfam; PF00047; ig; 2. 

DR SMART; SM00407; IGcl; 1. 

DR PROSITE; PS50835; IG_LIKE; 2. 

DR PROSITE; PS00290; IG_MHC; 1. 

KW Immunoglobulin domain; Immunoglobulin C region,- Glycoprotein; Repeat; 

KW Pyrrol idone carboxylic acid. 



FT 


DOMAIN 


12 


73 


HINGE. 










FT 


DOMAIN 


74 


183 


CH2. 










FT 


DOMAIN 
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289 


CH3 . 
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CARBOXYLIC 


ACID. 




FT 


CARBOHYD 


6 


6 


N -LINKED (GLCNAC 


. .) 






FT 


DISULFID 


7 


7 


INTERCHAIN 


(WITH 


HEAVY 


CHAIN 


DIMER) 


FT 


DISULFID 


24 


24 


INTERCHAIN 
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FT 
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42 
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HEAVY 
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DIMER) 


FT 


DISULFID 


48 
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INTERCHAIN 
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HEAVY 
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DIMER) 


FT 


DISULFID 


54 


54 


INTERCHAIN 


(WITH 


HEAVY 


CHAIN 


DIMER) 


FT 


DISULFID 


57 


57 


INTERCHAIN 
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HEAVY 


CHAIN 


DIMER) 


FT 


DISULFID 


63 


63 


INTERCHAIN 
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HEAVY 


CHAIN 


DIMER) 


FT 


DISULFID 


69 


69 


INTERCHAIN 
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HEAVY 


CHAIN 


DIMER) 


FT 


DISULFID 


72 


72 


INTERCHAIN 


(WITH 


HEAVY 


CHAIN 


DIMER) 


FT 


CARBOHYD 


140 


140 


N- LINKED (GLCNAC. 


• .) . 






FT 


MOD RES 
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REMOVED POST- TRANS LAT I ONALLY . 
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UAD T AMT 1 
V -rvTs. ± J-iiN 1 




9 97 
ZZ / 


D - > JM V 1 JN UlWJ J . 


FT 








/FTId=VAR_003894. 


FT 


VARIANT 


221 


227 


MISSING (IN ZUC) . 


FT 








/FTId=VAR_003895. 


FT 


VARIANT 


279 


279 


F -> Y (IN OMM) . 


FT 








/FTId=VAR_003896 . 


SQ 


SEQUENCE 


290 AA; 


32331 


MW; E69CBC95705B2F46 



Query Match 91.0%; Score 1126; DB 1; Length 290; 

Best Local Similarity 90.3%; Pred. No. 9.6e-85; 

Matches 205; Conservative 13; Mismatches 9; Indels 0; Gaps 0; 



Qy 


2 


DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCWVDVSHEDPEVKFNWYVD 


61 






1 II IIIIIIIMIIIIIIMIIIIIMIIIIIIMIMIIIIIIIIIIM MM 




Db 


64 


DTPPPCPRCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVQFKWYVD 


123 


Qy 


62 


GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAK 121 






IhllMlllllhhIlhlllMIMIIhlhllMIIIIIIIIIIIIMIIMI 1 




Db 


124 


GVQWNAKTKPREQQFNSTFRWSVLTVLHQNWLDGKEYKCKVSNKALPAPI EKTI SKTK 


183 


Qy 


122 


GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 


181 






1 1 1 1 II 1 M 1 1 1 1 MM 1 1 1 M II 1 1 1 II 1 M 1 1 1 1 1 1 1 1 1 H 1 1 1 1 II IIIIMM 




Db 


184 


GQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESSGQPENNYNTTPPMLDS 


243 


Qy 


182 


DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 








[ 1 1 M 1 M 1 1 ! 1 1 1 1 1 1 M E 1 = 1 1 1 1 i 1 1 1 1 1 1 E MIMIIMIII 




Db 


244 


DGSFFLYSKLTVDKSRWQQGNIFSCSVMHEALHNRFTQKSLSLSPGK 2 90 





RESULT 5 
GC_RABIT 

ID GC_RABIT STANDARD; PRT; 323 AA. 

AC P01870; 

DT 21-JUL-1986 (Rel . 01, Created) 

DT 21-JUL-1986 (Rel. 01, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Ig gamma chain C region. 

OS Oryctolagus cuniculus (Rabbit) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Lagomorpha; Leporidae; Oryctolagus. 

OX NCBI_TaxID=9986; 

RN [1] 

RP SEQUENCE FROM N . A . 

RX MEDLINE-84030930; PubMed=63 13520 ; 

RA Bernstein K.E., Alexander C.B., Mage R.G.; 

RT "Nucleotide sequence of a rabbit IgG heavy chain from the recombinant 

RT F-I haplotype."; 

RL Immunogenetics 18:387-397(1983). 

RN [2] 



RP SEQUENCE OF 1-128. 

RX MEDLINE=76135469; PubMed=1243651 ; 

RA Pratt D.M., Mole L.E.; 

RT "Sequence studies on the constant region of the Fd sections of rabbit 

RT immunoglobulin G of different allotype."; 

RL Biochem. J. 151:337-349(1975). 

RN [3] 

RP SEQUENCE OF 88-266 FROM N . A . 

RX MEDLINE=83299917; PubMed=6193 512 ; 

RA Martens C.L., Moore K.W. , Steinmetz M. , Hood L. , Knight K.L.; 

RT "Heavy chain genes of rabbit IgG: isolation of a cDNA encoding gamma 

RT heavy chain and identification of two genomic C gamma genes."; 

RL Proc. Natl. Acad. Sci. U.S.A. 79:6018-6022(1982). 

RN [4] 

RP SEQUENCE OF 132-161. 

RX MEDLINE=70110015; PubMed=54 61 1 06 ; 

RA Fruchter R.G., Jackson S.A., Mole L.E., Porter R.R. ; 

RT "Sequence studies of the Fd section of the heavy chain of rabbit 

RT immunoglobulin G."; 

RL Biochem. J. 116:24 9-259(1970). 

RN [5] 

RP SEQUENCE OF 129-131 AND 155-322. 

RA Hill R.L., Lebovitz H.E., Fellows R.E. Jr., Delaney R. ; 

RL (In) Killander J. (eels.); 

RL Gamma globulins, Nobel symp. 3, pp. 109-127, Almqvist and Wiksell, 

RL Stockholm (1967) . 

CC -!- MISCELLANEOUS : REF . 1 SEQUENCE HAS THE D12 ALLOTYPIC MARKER , 

CC 104-THR, AND THE E14 MARKER, 185-THR. REF. 3 HAS THE Dll AND E15 

CC MARKERS AND REF . 5 THE El 5 MARKER. 

CC -!- SIMILARITY: Contains 3 immunoglobul in-1 ike domains. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformatics and the EMBL outstation - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; M16426; AAA3128 9.1; -. 

DR PIR; A91749; GHRB . 

DR HSSP; P01857; 1FC1 . 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003597; Ig_cl . 

DR InterPro; IPR003006; Ig_MHC. 

DR Pfam; PF00047; ig; 2. 

DR SMART; SM00407; IGcl; 2. 

DR PROSITE; PS50835; IG_LIKE; 3. 

DR PROSITE; PS00290; IG_MHC; 1. 

KW Immunoglobulin domain; Immunoglobulin C region; Repeat. 

FT NON_TER 1 1 

FT DOMAIN 6 96 IG-LIKE 1. 

FT DOMAIN 114 213 IG-LIKE 2. 

FT DOMAIN 222 318 IG-LIKE 3. 

FT VARIANT 104 104 T -> M (IN Dll MARKER) . 

FT VARIANT 185 185 T -> A (IN E15 MARKER) 

FT CONFLICT 48 48 N -> E (IN REF. 2) . 
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FT 
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REF. 
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FT 
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REF. 


5) . 


FT 
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260 
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D 
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REF. 


5) . 


FT 


CONFLICT 


266 
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-> 
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(IN 


REF. 


5) . 


FT 


CONFLICT 


280 


280 
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- > 


W 


(IN 


REF. 


5) . 


FT 


CONFLICT 


284 
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N 


- > 


S 


(IN 


REF . 


5) . 


SQ 


SEQUENCE 


323 AA; 


35404 MW; 




69E8AA118D579A8B CRC64 ; 


Query Match 




74 .2%; 


Score 918. 


5; DB 1; Length 



Best Local Similarity 71.7%; Pred. No. 8.9e-68; 

Matches 167; Conservative 29; Mismatches 32; Indels 5; Gaps 



2; 



Qy 


1 


Db 


91 


Qy 


56 


Db 


151 


Qy 


116 


Db 


211 


Qy 


176 


Db 


271 



MDKT HTC- -PPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVK 55 

= 111 II I II I MM I I. M II' I I! M :||||: 



FNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEK 115 
I I :| h ||:|:||| | | | | | : III |||:|||| MM II 

FTWYINNEQVRTARPPLREQQFNSTIRWSTLPITHQDWLRGKEFKCKVHNKALPAPIEK 210 

TISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTT 175 
Mllhlll HI- I I : I I : llllh: I , ||: |:||||| 

TI SKARGQPLEPKVYTMGPPREELSSRS VSLTCMI NGFYPSDI SVEWEKNGKAEDNYKTT 270 



I llllllhllhlhl I Ihhlhlllllllllllllllhl I I I I 



RESULT 6 
GC2_CAVPO 

ID GC2_CAVPO STANDARD; PRT; 329 AA. 

AC P01862; 

DT 21-JUL-1986 (Rel . 01, Created) 

DT 21-JUL-1986 (Rel. 01, Last sequence update) 

DT 15-JUL-1999 (Rel. 38, Last annotation update) 

DE Ig gamma -2 chain C region. 

OS Cavia porcellus (Guinea pig) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Hystricognathi ; Caviidae; Cavia. 

OX NCB I_Tax I D= 1 0 1 4 1 ; 

RN [1] 

RP SEQUENCE OF 1-3. 

RA Trischmann T.M.; 

RL Submitted (APR-1975) to the PIR data bank. 

RN [2] 

RP SEQUENCE OF 4-68. 

RX MEDLINE=71058471; PubMed=553 8 606 ; 

RA Birshtein B.K., Hussain Q.Z., Cebra J. J. ; 

RT "Structure of heavy chain from strain 13 guinea pig 



RT immunoglobulin-G (2) . 3. Amino acid sequence of the region around the 

RT half-cystine joining heavy and light chains."; 

RL Biochemistry 10:18-25(1971). 

RN [3] 

RP SEQUENCE OF 69-133 AND 312-329. 

RX MEDLINE=71058486; PubMed=553 8616 ; 

RA Turner K.J., Cebra J.J.; 

RT "Structure of heavy chain from strain 13 guinea pig 

RT immunoglobulin-G (2) . II. Amino acid sequence of the carboxyl -terminal 

RT and hinge region cyanogen bromide fragments."; 

RL Biochemistry 10:9-17(1971). 

RN [4] 

RP SEQUENCE OF 134-226. 

RX MEDLINE=75036072; PubMed=442 9665 ; 

RA Tracey D.E. # Cebra J.J.; 

RT "Primary structure of the CH2 homology region from guinea pig IgG2 

RT antibodies."; 

RL Biochemistry 13:4796-4803(1974). 

RN [5] 

RP SEQUENCE OF 227-311. 

RX MEDLINE=75036073; PubMed=4 609467 ; 

RA Trischmann T.M., Cebra J.J.; 

RT "Primary structure of the CH3 homology region from guinea pig IgG2 

RT antibodies ." ; 

RL Biochemistry 13:48 04-4811(1974). 

RN [6] 

RP DISULFIDE BONDS. 

RX MEDLINE=71058474; PubMed=4 922544 ; 

RA Oliveira B., Lamm M.E.; 

RT "Interchain disulfide bridges of guinea pig gamma-2- immunoglobulin . " ; 

RL Biochemistry 10:26-31(1971). 

CC -!- MISCELLANEOUS : THIS CHAIN WAS ISOLATED FROM POOLED SERUM OF STRAIN 

CC 13 INBRED GUINEA PIGS. 

DR PIR; A94553; G2GP . 

DR HSSP; P01842; 7 FAB . 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003597; Ig_cl . 

DR InterPro; IPR003006; Ig_MHC. 

DR Pfam; PF00047; ig ; 2. 

DR SMART; SM004 07; IGcl; 2. 

DR PROSITE; PS50835; IG_LIKE; 3. 

DR PROSITE; PS00290; IG_MHC; 1. 

KW Immunoglobulin domain; Immunoglobulin C region; Glycoprotein. 

FT NON_TER 1 1 

FT DISULFID 16 16 INTERCHAIN (WITH A LIGHT CHAIN) . 

FT DISULFID 28 79 

FT DISULFID 105 105 INTERCHAIN (WITH A HEAVY CHAIN) . 

FT DISULFID 107 107 INTERCHAIN (WITH A HEAVY CHAIN) . 

FT DISULFID 110 110 INTERCHAIN (WITH A HEAVY CHAIN) . 

FT DISULFID 142 202 

FT CARBOHYD 178 178 N-LINKED (GLCNAC. . .). 

FT DISULFID 248 308 

SQ SEQUENCE 329 AA; 36074 MW; 5D23 1B7164D1FBA9 CRC64 ; 



Query Match 71.8%; Score 889; DB 1; Length 329; 

Best Local Similarity 72.3%; Pred. No. 2.3e-65; 

Matches 162; Conservative 24; Mismatches 36; Indels 2; Gaps 1; 



Qy 


6 


Db 


106 


Qy 


66 


Db 


166 


Qy 


126 


Db 


226 


Qy 


184 


Db 


286 



TCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCWVDVSHEDPEVKFNWYVDGVEV 65 

III II M llllllhllllllllllll II lllllllll HIM hll I 

TCPKCPPPENLGGPSVFIFPPKPKDTLMISLTPRVTCVWDVSQDEPEVQFTWFVDNKPV 165 

HNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAKGQPR 125 

Ihllll 1 III : Mill llhllll MIIIMIIIIMI II II 

GNAETKPRVEQYNTTFRVES VLP I QHQDWLRGKEFKCKVYNKALPAP I EKTI SKTKGAPR 225 
EPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQP- -ENNYKTTPPVLDSDG 183 

I ' hhll III II I I II Mh hll 

MPDVYTLPPSRDELSKSKVSVTCLI INFFPADTHVEWASNRVPVSEKEYKNTPPI EDADG 285 

S FFL YS KLT VDKSRWQQGN VFS CS VMHEALHNH YTQKSLS LS PG 227 

hlllllllllll I II h:|||||||||ll Ilh:| III 
SYFLYSKLTVDKSAWDQGTVYTCSVMHEALHNHVTQKAISRSPG 32 9 



RESULT 7 
GC3_MOUSE 

ID GC3_MOUSE STANDARD; PRT; 32 9 AA. 

AC P22436; 

DT 01-AUG-1991 (Rel . 19, Created) 

DT 01-AUG-1991 (Rel. 19, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE Ig gamma-3 chain C region, secreted form. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa,- Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=85027161; PubMed=6092 053 ; 

RA Wels J. A., Word C.J., Rimm D. , Der-Balan G.P., Martinez H.M., 

RA Tucker P.W., Blattner F.R.; 

RT "Structural analysis of the murine IgG3 constant region gene."; 

RL EMBO J. 3:2041-2046(1984). 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; J00451; -; NOT_ANNOTATED_CDS . 

DR PIR; B02156; G3MSC. 

DR HSSP; P01857; 1FC1 . 

DR InterPro; IPR007110; Ig-like. 

DR InterPro,- IPR003597; Ig_cl. 

DR InterPro; IPR003006; Ig_MHC. 

DR Pfam; PF00047; ig ; 3. 

DR SMART; SM004 07; IGcl; 2. 

DR PROSITE; PS50835; IG_LIKE; 3. 

DR PROSITE; PS00290; IG_MHC; 1. 

KW Immunoglobulin domain; Immunoglobulin C region; Glycoprotein; 



KW Transmembrane; Alternative splicing. 

FT NON_TER 1 1 

FT DOMAIN 1 97 CHI. 

FT DOMAIN 98 113 HINGE. 

FT DOMAIN 114 223 CH2 . 

FT DOMAIN 224 327 CH3 . 

SQ SEQUENCE 329 AA; 36228 MW; F45827174 182BAD6 CRC64 ; 

Query Match 68.3%; Score 845.5; DB 1; Length 329; 

Best Local Similarity 67.0%; Pred. No. 8.1e-62; 

Matches 150; Conservative 34; Mismatches 37; Indels 3; Gaps 1; 

Qy 8 PP CPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVE 64 

II II hlllllhlllllll I I II Ihlllllllll i :|:|| I 

Db 106 PPGSSCPPGNILGGPSVFIFPPKPKDALMISLTPKVTCWVDVSEDDPDVHVSWFVDNKE 165 

Qy 65 VHNAKTKPREEQYNSTYRWS VLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAKGQP 124 

II I hill I : lllh llhlllhllllllllhllll I 

Db 166 VHTAWTQPREAQYNSTFRWSALPIQHQDWMRGKEFKCKVNNKALPAPI ERTI SKPKGRA 225 

Qy 125 REPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGS 184 

: Mllhll h::M hllllll h hill I I hi llhllllh 

Db 226 QTPQVYTIPPPREQMSKKKVSLTCLVTNFFSEAISVEWERNGELEQDYKNTPPILDSDGT 285 

Qy 185 FFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

hllllllll I II , Hi I m. 1 1 1 1 

Db ^ 286 YFLYSKLTVDTDSWLQGE I FTCS WHEALHNHHTQKNLSRS PGK 329 



RESULT 8 
GCB_RAT 

ID GCB_RAT STANDARD; PRT; 333 AA. 

AC P20761; 

DT 01-FEB-1991 (Rel . 17, Created) 

DT 01-FEB-1991 (Rel. 17, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Ig gamma -2B chain C region. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=89232738; PubMed=314 994 6 ; 

RA Brueggemann M . ; 

RT "Evolution of the rat immunoglobulin gamma heavy-chain gene family."; 

RL Gene 74:473-482(1988). 

CC -!- SIMILARITY: Contains 3 immunoglobulin- like domains. 

DR PIR; PS0018; PS0018 . 

DR HSSP; P01842; 7 FAB . 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003597; Ig_cl . 

DR InterPro; IPR003006; Ig_MHC. 

DR Pfam; PF00047; ig; 3. 

DR SMART; SM004 07; IGcl; 2. 

DR PROSITE; PS50835; IG_LIKE; 3. 

DR PROSITE; PS002 90; IG_MHC; 1. 
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HEAVY CHAIN) 


FT 
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FT 


DISULFID 


253 


311 








SQ 


SEQUENCE 


333 AA; 


36497 MW; 55F8B64D48D4 60A6 


CRC64 ; 


Query Match 




67 


.7%; Score 838; 


DB 1; 


Length 333; 



Best Local Similarity 66. 4i 
Matches 148; Conservative 



; Pred. No. 3.3e-61; 
33; Mismatches 42; 



Indels 



0 ; Gaps 



0; 



Qy 


6 


Db 


111 


Qy 


66 


Db 


171 


Qy 


126 


Db 


231 


Qy 


186 


Db 


291 



TCPPCPAPELl^GPSVFLFPPKPKDTlJ/lISRTPEWCWvlOVSHEDPEVKFNWYvXlGVEV 65 
II II lllllllllhlllllll I : I I : II Ml I |:|:|:|:|:|: ||| 



HNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAKGQPR 125 

I IMMIIIIIMIMIII I : MlhMIIMMIMIIIIMIIIIIII II I 
HTAQTQPREEQYNSTFRWSALPIQHQDWMSGKEFKCKVNNKALPSPIEKTISKPKGLVR 230 

EPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF 185 

HIM : I i MINI II II III Ml I III I 1,1 

KPQVTVTVIGPPTEQLTEQTVSLTCLTSGFLPNDIGVEWTSNGHIEKNYKNTEPVMDSDGSF 290 



Mill 



III 



I MIMI III 



M III 



RESULT 9 
GC3M_MOUSE 

ID GC3M_MOUSE STANDARD; PRT; 3 98 AA. 

AC P03987; 

DT 23-OCT-1986 (Rel . 02, Created) 

DT 01-AUG-1991 (Rel. 19 , Last sequence update) 

DT 15-JUL-1999 (Rel. 38, Last annotation update) 

DE Ig gamma-3 chain C region, membrane -bound form. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=85027161; PubMed=6092 053 ; 

RA Wels J. A., Word C.J., Rimm D. , Der-Balan G.P., Martinez H.M. , 

RA Tucker P.W., Blattner F.R.; 

RT "Structural analysis of the murine IgG3 constant region gene."; 

RL EMBO J. 3:2041-2046(1984). 

RN [2] 

RP SEQUENCE OF 328-3 98 FROM N.A. 



RX MEDLINE=84041483; PubMed=63 14258 ; 

RA Komaromy M. , Clayton L., Rogers J., Robertson S. , Kettman J., 

RA Wall R. ; 

RT "The structure of the mouse immunoglobulin in gamma 3 membrane gene 

RT segment."; 

RL Nucleic Acids Res. 11:6775-6785(1983). 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformat ics and the EMBL outstation - 

CC the European Bioinformat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; J00451; AAB59655.1; -. 

DR EMBL; V01526; CAA24767.1; ALT_SEQ . 

DR PIR; A02156; G3MSM. 

DR HSSP; P01857; 1FC1 . 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003597; Ig_cl . 

DR InterPro; IPR003006; Ig_MHC. 

DR Pfam; PF00047; ig ; 3. 

DR SMART; SM00407; IGcl; 2. 

DR PROSITE; PS50835; IG_LIKE; 3. 

DR PROSITE; PS00290; IG_MHC; 1. 

KW Immunoglobulin domain; Immunoglobulin C region; Glycoprotein; 

KW Transmembrane; Alternative splicing. 
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362 


POTENTIAL. 


FT 


DOMAIN 
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CYTOPLASMIC (POTENTIAL) . 


FT 


CONFLICT 


333 


333 


E -> G (IN REF. 2) . 


FT 


CONFLICT 


342 


342 


E -> Q (IN REF. 2) . 


FT 


CONFLICT 


388 


388 


P -> F (IN REF. 2) . 


SQ 


SEQUENCE 


398 AA; 


43929 


MW; CF7F264B50A41B95 CRC64 ; 


Query Match 




67.4% 


; Score 834.5; DB 1; Length 


Best Local Similarity 


66.7% 


Pred. No. 8e-61; 


Matches 14 8; 


Conservative 


34; Mismatches 37; Indels 



3 ; Gaps 1 ; 

Qy 8 PP CPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVE 64 

II II MMMIIMIIMII 1 1 1 1 IhlMIIMII Mhl MMI I 

Db 106 PPGSSCPPGNILGGPSVFIFPPKPKDALMISLTPKVTCVWDVSEDDPDVHVSWFVDNKE 165 

Qy 65 VHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAKGQP 124 

M I hill llllhllll I : IMh llhlllhllllllllhllll i 
Db 166 VHTAWTQPREAQYNSTFRWSALPIQHQDWMRGKEFKCKVNNKALPAPI ERTI SKPKGRA 225 

Qy 125 REPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGS 184 

: I I I I I I I h — l 'MM \ II M= I Ml llhllllh 

Db 226 QTPQVYTIPPPREQMSKKKVSLTCLVTNFFSEAISVEWERNGELEQDYKNTPPILDSDGT 285 

Qy 185 FFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSP 226 



:|IMIIIII I II : Ml M II 1 1 .M IM I II 

Db 286 YFLYSKLTVDTDSWLQGEIFTCSWHEALHNHHTQKNLSRSP 327 

RESULT 10 
GC1_RAT 

ID GC1_RAT STANDARD; PRT; 326 AA. 

AC P20759; 

DT 01-FEB-1991 (Rel . 17, Created) 

DT 01-FEB-1991 (Rel. 17, Last sequence update) 

DT 15-JUL-1999 (Rel. 38, Last annotation update) 

DE Ig gamma -1 chain C region. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N . A . 

RX MEDLINE=89232738; PubMed=3 14 9 946 ; 

RA Brueggemann M. ; 

RT "Evolution of the rat immunoglobulin gamma heavy-chain gene family."; 



RL 


Gene 74 : 473-482 ( 1988) . 










DR 


PIR; PS0017; PS0017. 
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HSSP; P01842; 7 FAB 












DR 


InterPro; 


IPR007110; Ig 


-like. 
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InterPro; 


IPR003597; Ig 


_cl. 








DR 


InterPro ; 


IPR003006; Ig" 


_MHC. 








DR 


Pfam; PF00047; ig ; 


3 . 










DR 


SMART; SM00407; IGcl; 2 










DR 


PROSITE; 


PS50835; 


IG_LIKE; 3. 
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IG MHC; 1. 
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N -LINKED (GLCNAC. . 


.) (POTENTIAL) 


SQ 


SEQUENCE 


326 AA; 


35946 MW; 


013BAB45EF49B9DA 


CRC64 ; 


Query Match 




66 


.1%; 


Score 818.5; DB 1; 


Length 326; 


Best Local Similarity 


63 


.1%; 


Pred . No . 1 


.3e-59; 





Matches 142; Conservative 41; Mismatches 35; Indels 7; Gaps 2; 

Qy 7 CPPCPAPELLGG PSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGV 63 

I M = I Mil | |: Ihllllllhl :|||| h|:|| | 

Db 106 CKPC ICTGSEVSSVFIFPPKPKDVIjTITLTPKVTCVVVDISQDDPEVHFSWFVDDV 161 

Qy 64 EVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQ 123 

III hhl llhllhl II I Hllllllh I I hlllMMI , 



Db 



162 EVHTAQTRPPEEQFNSTFRSVSELP I LHQDWLNGRTFRCKVTSAAFPS PI EKTI SKPEGR 221 



Qy 124 PREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDG 183 

: I M. h-hhHhlhllllll II llh 1111= III III :hll 

Db 222 TQVPHVYTMSPTKEEMTQNEVSITCMVKGFYPPDIYVEWQMNGQPQENYKNTPPTMDTDG 281 

Qy 184 SFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

I | | :||||| |:|||:|| Mlhhllll | | | | 
Db 282 SYFLYSKLNVKKEKWQQGNTFTCSVLHEGLHNHHTEKSLSHSPGK 326 



RESULT 11 
GC1_M0USE 

ID GC1_M0USE STANDARD; PRT; 324 AA. 

AC P01868; 

DT 21-JUL-1986 (Rel. 01, Created) 

DT 21-JUL-1986 (Rel. 01, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Ig gamma -1 chain C region secreted form. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=80045036; PubMed=115593 ; 

RA Honjo T . , Obata M . , Yamawaki-Kataoka Y. , Kataoka T. , Kawakami T. , 

RA Takahashi N., Mano Y. ; 

RT "Cloning and complete nucleotide sequence of mouse immunoglobulin 

RT gamma 1 chain gene."; 

RL Cell 18:559-568(1979). 

RN [2] 

RP SEQUENCE OF 76-324 FROM N.A. (MYELOMA PROTEIN MOPC 31C) . 

RX MEDLINE=80202559; PubMed=6769752 ; 

RA Obata M. , Yamawaki-Kataoka Y. , Takahashi N. , Kataoka T., Shimizu A. , 

RA Mano Y. , Seidman J.G., Peterlin B.M., Leder P., Honjo T. ; 

RT "Immunoglobulin gamma 1 heavy chain gene: structural gene sequences 

RT cloned in a bacterial plasmid.",- 

RL Gene 9 : 87-97 ( 1980) . 

RN [3] 

RP SEQUENCE OF -70-322 FROM N.A. (MYELOMA PROTEIN MOPC 21) ." 

RX MEDLINE=80012837; PubMed=113776 ; 

RA Rogers J., Clarke P., Salser W. ; 

RT "Sequence analysis of cloned cDNA encoding part of an immunoglobulin 

RT heavy chain. " ; 

RL Nucleic Acids Res. 6:3305-3321(1979). 

RN [4] 

RP SEQUENCE (MYELOMA PROTEIN MOPC 21) . 

RX MEDLINE=78242288; PubMed=98 524 ; 

RA Adetugbo K. ; 

RT "Evolution of immunoglobulin subclasses. Primary structure of a 

RT murine myeloma gammal chain."; 

RL J. Biol. Chem. 253:6068-6075(1978). 

RN [5] 

RP DISULFIDE BONDS (MOPC 21) . 

RX MEDLINE=73008889; PubMed=5073237 ; 

RA Svasti J., Milstein C. ; 



RT "The disulphide bridges of a mouse immunoglobulin Gl protein."; 

RL Biochem. J. 126:837-850(1972). 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC - ! - ALTERNATIVE PRODUCTS : 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=Secreted; 

CC IsoId=P01868-l; Sequence=Displayed ; 

CC Note=May be the major isoform; 

CC Name=Membrane -bound; 

CC IsoId=P01869-l; Sequence^ External ; 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformatics and the EMBL outstation - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 



DR 


EMBL; 


V00793; 


CAA24172 


1; 


DR 


EMBL; 


V00793; 


CAA24173 


l; 


DR 


EMBL ; 


V00793; 


CAA24174 


1; 


DR 


EMBL; 


V00793; 


CAA24175 


l; 


DR 


EMBL ; 


V00795; 


CAA24176 


l; 



DR PIR; A02159; G1MS. 
DR PDB; 1IGC; 03-JUN-95. 
DR GlycoSuiteDB; P01868; -. 



DR 


MGD; MGI: 


96446; Igh-4 . 








DR 


Inter Pro; 


IPR007110; Ig-like. 








DR 


InterPro; 


IPR003597; Ig_cl. 








DR 


InterPro; 


IPR003006; Ig MHC. 








DR 


Pfam; PF00047; ig ; 


3 . 








DR 


SMART; SM00407; IGcl; 2. 








DR 


PROSITE; 


PS50835; 


IG LIKE; 3 . 








DR 


PROSITE; 


PS00290; 


IG MHC; 1. 








KW 
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region; 


Glycoprotein 


KW 


Alternative splicing; 3D-structure . 
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FT 








/FTId=CAR 000055. 




FT 


DISULFID 
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FT 


M0D_RES 
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REMOVED POST- TRANS LAT I ONALLY . 


FT 


CONFLICT 
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276 


N -> D (IN 


REF. 3) . 




FT 


CONFLICT 


278 
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N -> D (IN 


REF. 3) . 




SQ 


SEQUENCE 


324 AA; 


35704 MW; 


A338812F3D1F2C93 


CRC64 ; 



Query Match 



66.0%; Score 817.5; DB 1; Length 324; 



Best Local Similarity 62.5%; Pred. No. 1.5e-59; 

Matches 140; Conservative 44; Mismatches 35; Indels 5; Gaps 2; 



Qy 7 CPP--CPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVE 64 

llllh M Ml. II I I I Ihllllllhl :|||h|:hll II 

Db 104 CKPCICTVPEV SSVFI FPPKPKDVLTITLTPKVTCWVDISKDDPEVQFSWFVDDVE 160 

Qy 65 VHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQP 124 

II hhlillhllhl II I ::||||llllhlh|:: I llllllllll Ihl 
Db 161 VHTAQTQPREEQFNSTFRSVSELPIMHQDWLNGKEFKCRWSAAFPAPIEKTISKTKGRP 220 

Qy 12 5 REPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGS 184 

: llllhll :::: I — I I I I I — hi II llh I I I I III I ^ M 

Db 221 KAPQVYTIPPPKEQMAKDKVSLTCMITDFFPEDITVEWQWNGQPAENYKNTQPIMNTNGS 280 

Qy 185 FFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

= 1=1111 I II 1 = II I lllhhllll I I I I 

Db 281 YFVYSKLNVQKSNWEAGNTFTCSVLHEGLHNHHTEKSLSHSPGK 324 



RESULT 12 
GC1M_M0USE 

ID GC1M_M0USE STANDARD; PRT; 393 AA. 

AC P01869; 

DT 21-JUL-1986 (Rel. 01, Created) 

DT 01-AUG-1991 {Rel. 19, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Ig gamma -1 chain C region, membrane -bound form. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N . A . 

RX MEDLINE=80045036; PubMed=115593 ; 

RA Honjo T., Obata M. , Yamawaki-Kataoka Y., Kataoka T. , Kawakami T., 

RA Takahashi N . , Mano Y.; 

RT "Cloning and complete nucleotide sequence of mouse immunoglobulin 

RT gamma 1 chain gene."; 

RL Cell 18:559-568(1979). 

RN. [2] " 

RP SEQUENCE OF 323-393 FROM N . A . 

RX MEDLINE=82197626; PubMed=68 04 950 ; 

RA Tyler B.M., Cowman A.F., Gerondakis S.D., Adams J.M. , Bernard 0. ; 

RT "mRNA for surface immunoglobulin gamma chains encodes a highly 

RT conserved transmembrane sequence and a 28 -residue intracellular 

RT domain."; 

RL Proc. Natl. Acad. Sci. U.S.A. 79:2008-2012(1982). 

RN [3] 

RP SEQUENCE OF 323-3 66 FROM N . A . 

RX MEDLINE=82115295; PubMed=67992 07 ; 

RA Rogers J., Choi E., Souza L., Carter C, Word C. J. , Kuehl M. , 

RA Eisenberg D. , Wall R.; 

RT "Gene segments encoding transmembrane carboxyl termini of 

RT immunoglobulin gamma chains."; 

RL Cell 26:19-27(1981) . 

RN [4] 
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DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 



SEQUENCE OF 1-44 FROM N.A. 
MEDLINE=82222190; PubMed=6283537 ; 

Yamawaki-Kataoka Y . , Nakai S., Miyata T. , Honjo T.; 

"Nucleotide sequences of gene segments encoding membrane domains of 

immunoglobulin gamma chains." ; 

Proc. Natl. Acad. Sci. U.S.A. 79:2623-2627(1982). 
-!- ALTERNATIVE PRODUCTS: 

Event =Alternative splicing; Named isoforms=2; 
Name=Membrane-bound; 

IsoId=P01869-l ; Sequence=Displayed; 
Name=Secreted; 

IsoId=P01868-l; Sequence=External ; 
Note=May be the major isoform; 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinformat ics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib. ch) . 

EMBL; V00793; CAA24172.1 
EMBL; V00793; CAA24173.1 
EMBL; V00793; CAA24174.1 



PIR 
PDB 
PDB 
PDB 
PDB 
PDB 
PDB 
PDB 
PDB 
MGD 



1F11; 
1F58; 



B02159; G1MSM. 
15C8; 23-MAR-99. 
1AE6; 18-MAR-98. 
1CL7; 12-JAN-OO. 

06-FEB-01. 
29-DEC-99. 
1KC5; 24 -JUL- 02. 
1KCR; ll-MAY-02. 
25C8; 09-JUL-99. 
MGI: 96446; Igh-4 . 
InterPro; IPR007110; Ig-like. 
InterPro; IPR003597; Ig_cl. 
InterPro; IPR003006; Ig_MHC. 
Pfam; PF00047; ig ; 3. 
SMART; SM004 07; IGcl; 2. 
PROSITE; PS50835; IG_LIKE; 3. 
PROSITE; PS00290; IG_MHC; 1. 

Immunoglobulin domain; Immunoglobulin C region; Glycoprotein; 



KW 


Alternative 


splicing; 


Transmembrane; 3D-structure . 
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FT 
SQ 



TRANSMEM 

DOMAIN 

SEQUENCE 



34 0 357 POTENTIAL. 
358 393 CYTOPLASMIC (POTENTIAL) . 

393 AA; 43386 MW; 4CC88343B7A1CE27 CRC64 ; 



Query Match 65.6%; Score 812.5; DB 1; Length 393; 

Best Local Similarity 62.3%; Pred. No. 4.9e-59; 

Matches 13 9; Conservative 44; Mismatches 35; Indels 5; Gaps 2; 

Qy 7 CPP--CPAPELLGGPSVFLFPPKPKDTLMISRTPEVTOAA/DVSHEDPEVKFNWYVDGVE 64 

I I I 11= Mhlllllll I h Ihllllllhl ■ I:.-,::' || 

Db 104 CKPCICTVPEV SSVFI FPPKPKDVLTITLTPKVTCVWDISKDDPEVQFSWFVDDVE 160 

Qy 65 VHNAKTKPREEQYNSTYRWS VIjTVLHQDWLNGKEYKCKVSNKALPAP I EKTI SKAKGQP 124 

II hhllllhllhl II I - I I- - | |||||||||| 

Db 161 VHTAQTQPREEQFNSTFRSVSELPIMHQDWLNGKEFKCRVNSAAFPAPIEKTISKTKGRP 220 

Qy 12 5 REPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGS 184 

: Mllhll I - I I I I I : = hi II Mh I I II III I h^-ll 

Db 221 KAPQVYTIPPPKEQMAKDKVSLTCMITDFFPEDITVEWQWNGQPAENYKNTQPIMNTNGS 280 

Qy 185 FFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG 227 

:hllll I II h II hllhll lllhhilll III 
Db 281 YFVYSKLNVQKSNWEAGNTFTCSVLHEGLHNHHTEKSLSHSPG 323 



RESULT 13 
GCC_RAT 

ID GCC_RAT STANDARD; PRT; 32 9 AA. 

AC P20762; 

DT 01-FEB-1991 (Rel . 17, Created) 

DT 01-FEB-1991 (Rel. 17, Last sequence update) 

DT 15-JUL-1999 (Rel. 38, Last annotation update) 

DE Ig gamma-2C chain C region. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCB I_TaxI D= 10116; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=88166903; PubMed=3 127222 ; 

RA Brueggemann M., Delmastro-Galf re P., Waldmann H., Calabi F.; 

RT "Sequence of a rat immunoglobulin gamma 2c heavy chain constant 

RT region cDNA: extensive homology to mouse gamma 3 . " ; 

RL Eur. J . Immunol. 18:317-319(1988). 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; X07189; CAA30169.1; -. 

DR PIR; S00847; S00847. 

DR HSSP; P01842; 7 FAB . 

DR InterPro; IPR007110; Ig-like. 



DR 


InterPro; 


IPR003597; Ig 


_cl . 






DR 


InterPro; 


IPR003006; Ig' 


_MHC. 






DR 


Pfam; PF00047; ig; 


2 . 








DR 


SMART; SM00407; IGcl; 2 








FID 


PROSITE; 


PS50835; 


IG LIKE; 3. 






FIR 


PROSITE; 


PS00290; 


IG MHC; 1. 
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Immunoglobulin domain; 


Immunoglobulin C region. 
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SQ 


SEQUENCE 


329 AA; 


36571 MW; 


5FCD7B7933850773 


CRC64; 


Query Match 




65 


.4%; 


Score 80 9.5; DB 1; 


Length 32 9; 



Best Local Similarity 63.8%; Pred. No. 6.9e-59; 

Matches 143; Conservative 41; Mismatches 37; Indels 3; Gaps 1; 



Qy 8 PP CPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVE 64 

M I : I I lllhlllllll llh Ihlllllllll hhhhhil I 
Db 106 PPTDICSCDDNLGRPSVFIFPPKPKDILMITLTPKVTCVWDVSEEEPDVQFSWFVDNVR 165 

Qy 65 VHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAKGQP 124 

I I Ml I hllll I : Mlh:|lhlMhll Ihllllllll ' 

Db 166 VFTAQTQPHEEQLNGTFRWSTLHIQHQDWMSGKEFKCKVNNKDLPSPIEKTISKPRGKA 225 

Qy 125 REPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGS 184 

I llllhll |::::|hllllhl llh hllll I I :|| | III I 
Db 226 RTPQVYTI PPPREQMSKNKVSLTCMVTSFYPAS I SVEWERNGELEQDYKNTLPVLDSDES 285 

Qy 185 FFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

HI I -I : Mi |: INI | | | | 

Db 286 YFLYSKLSVDTDSWMRGDI YTCSWHEALHNHHTQKNLSRSPGK 32 9 

RESULT 14 
GCAA_MOUSE 

ID GCAA_MOUSE STANDARD; PRT; 33 0 AA. 

AC P01863; 

DT 21-JUL-1986 (Rel. 01, Created) 

DT 21-JUL-1986 (Rel. 01, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Ig gamma-2A chain C region, A allele. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
OX NCBI_TaxI D= 10 090; 
RN [1] 

RP SEQUENCE FROM N . A. 

RX MEDLINE=81076554; PubMed=6777755 ; 

RA Sikorav J.-L., Auffray C. , Rougeon F. ; 



RT "Structure of the constant and 3* untranslated regions of the murine 

RT Balb/c gamma 2a heavy chain messenger RNA . " ; 

RL Nucleic Acids Res. 8:3143-3155(1980). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=81198976; PubMed=6262729 ; 

RA Yamawaki-Kataoka Y., Miyata T. , Honjo T. ; 

RT "The complete nucleotide sequence of mouse immunoglobin gamma 2a gene 

RT and evolution of heavy chain genes: further evidence for intervening 

RT sequence-mediated domain transfer."; 

RL Nucleic Acids Res. 9:1365-1381(1981). 

RN [3] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=81223894; PubMed=6787604 ; 

RA Olio R., Auffray C, Morchamps C. , Rougeon F.; 

RT "Comparison of mouse immunoglobulin gamma 2a and gamma 2b chain genes 

RT suggests that exons can be exchanged between genes in a multigenic 

RT family."; 

RL Proc. Natl. Acad. Sci. U.S.A. 78:2442-2446(1981). 

RN [4] 

RP MYELOMA PROTEIN MOPC 173. 

RX MEDLINE=74175517; PubMed-4 83197 0 ; 

RA Bourgois A., Fougereau M. , Rocca-Serra J.; 

RT "Determination of the primary structure of a mouse IgG2a 

RT immunoglobulin:amino-acid sequence of the Fc fragment. Implications 

RT for the evolution of immunoglobulin structure and function."; 

RL Eur. J. Biochem. 43:423-435(1974). 

RN [5] 

RP DISULFIDE BONDS. 

RX MEDLINE=73056887; PubMed=45654 06 ; 

RA de Preval C. , Fougereau M.; 

RT "Determination of the primary structure of a mouse gamma G2a 

RT immunoglobulin. Identification of the disulfide bridges."; 

RL Eur. J. Biochem. 30:452-462(1972). 

CC -!- SIMILARITY: Contains 3 immunoglobulin-like domains. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC. modif ied and this statement is not removed; Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; V00798; CAA24 178.1; 

DR PIR; A02152; G2MSA. 

DR PDB; 1E4W; 12-JUL-01. 

DR PDB; 1E4X; 12 -JUL- 01. 

DR PDB; 1MNU; 06 -MAY- 99. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003597; Ig_cl . * 

DR InterPro; IPR003006; Ig_MHC. 

DR Pfam; PF00047; ig ; 2. 

DR SMART; SM00407; IGcl; 2. 

DR PROSITE; PS50835; IG_LIKE; 3. 

DR PROSITE; PS00290; IGJVIHC; 1. 

KW Immunoglobulin domain; Immunoglobulin C region; 3D-structure; Repeat. 
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FT 


MOD RES 


330 


330 


REMOVED POST- TRANS LAT I ONALLY . 


so 


SEQUENCE 


330 AA; 


36389 


MW; B84361C5445A6864 


CRC64 ; 


Query Match 




65. 3^ 


fe; Score 809; 


DB 1; 


Length 330; 



Best Local Similarity 65.2%; Pred. No. 7.6e-59; 

Matches 146; Conservative 30; Mismatches 46; Indels 2; Gaps 



l; 



Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 



7 CPP--CPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVE 64 

Ml I I I I lllllllhllll II I I I I H MINIMI : I i : i : :hh II 
107 CPPCKCPAPNLLGGPSVFIFPPKIKDVLMISLSPIVTCVVVDVSEDDPDVQISWFVNNVE 166 

65 VHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAKGQP 124 

II hh I MM MM I : MlhMlhlMhll MlllhllM II 

167 VHTAQTQTHREDYNSTLRWSALPIQHQDWMSGKEFKCKVNNKDLPAPIERTISKPKGSV 226 

125 REPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGS 184 

I MM III MMI Ihllhl I I II Ml Mh I III I i I I I I I 1 I 
227 RAPQVYVLPPPEEEMTKKQVTLTCMVTDFMPEDIYVEWTNNGKTELNYKNTEPVLDSDGS 286 

185 FFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

MMIM hi I : I Ml- MUM II I MM 

287 YFMYSKLRVEKKNWVERNSYSCSWHEGLHNHHTTKSFSRTPGK 330 



RESULT 15 
GCAM_MOUSE 

ID GCAM_MOUSE STANDARD; PRT; 3 99 AA. 

AC P01865; 

DT 21-JUL-1986 (Rel . 01, Created) 

DT 01--AUG-1991 (Rel. 19, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Ig gamma -2A chain C region, membrane -bound form. 

OS Mus musculus (Mouse) . 

OC Eukaryota,- Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N . A . 

RX MEDLINE=82222190; PubMed=628353 7 ; 

RA Yamawaki-Kataoka Y. , Nakai S. , Miyata T., Honjo T. ; 

RT "Nucleotide sequences of gene segments encoding membrane domains of 

RT immunoglobulin gamma chains."; 

RL Proc. Natl. Acad. Sci. U.S.A. 79:2623-2627(1982). 

CC - ! - ALTERNATIVE PRODUCTS : 

CC Event =Alternative splicing; Named isoforms=2; 

CC Name =Membr ane - bound ; 



CC IsoId=P01865-l; Sequence=Displayed; 

CC Name=Secreted; 

CC IsoId=P01864-l; Sequence^External ; 

CC Note=Probably the major isoform; 

CC -!- SIMILARITY: Contains 3 immunoglobulin- like domains. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformatics and the EMBL outstation - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; J00471; AAB59661.1; ALT_INIT. 

DR PIR; A02154; G2MSAM. 

DR PDB; 1KB5; 08 -APR- 98. 

DR PDB; 1YEE; 15-OCT-97. 

DR MGD; MGI : 96443 ; Igh-1. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003597; Ig_cl. 

DR InterPro; IPR003006; Ig_MHC. 

DR Pfam; PF00047; ig; 2. 

DR SMART; SM00407; IGcl; 2. 

DR PROSITE; PS50835; IG_LIKE; 3. 

DR PROSITE; PS00290; IG_MHC; 1. 

KW Immunoglobulin domain; Immunoglobulin C region; Glycoprotein; 

KW Transmembrane; Alternative splicing; 3D-structure; Repeat. 
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SQ 


SEQUENCE 


399 AA; 


44020 


MW; 4C38138BFAED3FF0 


CRC64; 



Query Match 64.9%; Score 804; DB 1; Length 399; 

Best Local Similarity 65.0%; Pred. No. 2.5e-58; 

Matches 145; Conservative 30; Mismatches 46; Indels 2; Gaps 1; 

Qy 7 CPP--CPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCWVDVSHEDPEVKFNWYVDGVE 64 

Ml I I I I IIIIIIIMIII II I I I I I I Mill :|hh :|:|: II 
Db 107 CPPCKCPAPNLLGGPSVFIFPPKIKDVLMISLSPIVTCVWDVSEDDPDVQISWFVNNVE 166 

Qy 65 VHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAKGQP 124 

II hi: I M I I I I I I I : I I I h = I I I = I I I h I I lllllhllll II 
Db 167 VHTAQTQTHREDYNSTLRWSALPIQHQDWMSGKEFKCKVNNKDLPAPIERTISKPKGSV 226 



Qy 125 REPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGS 184 



I 1 1 1 1 III Ihllhl I I II III i I IN I llllllll 

Db 227 RAPQWVLPPPEEEMTKKQVTLTCMVTDFMPEDI YVEWTNNGKTELNYKNTEPVLDSDGS 286 

Qy 185 FFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG 227 

= 1 = 1111 |:| | : | :||||:|| MM I | :|| 
Db 287 YFMYSKLRVEKKNWVERNSYSCSWHEGLHNHHTTKSFSRTPG 32 9 



Search completed: January 16, 2004, 09:36:59 
Job time : 18 sees 



